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AN APPETITIONAL THEORY OF SEXUAL 
MOTIVATION 


*. 


fa KENNETH R. HARDY 


Brigham Young University 


Following a brief review of evidence on the adequacy of biological 
explanations of sexual motivation, a cognitive-affective model of 
motivation is stated. The model is used to develop a conception of 


sexual motivation as an experientially developed appetite. 


2 consti- 


tutional sources of affect are proposed, each of which is sufficient to 
reward behavior: the pleasure of genital stimulation or arousal and 
the complex excitement-relaxation of orgasm. The nature of child- 
hood sexuality and the elaboration of sexual appetite in courtship 
are examined. After a discussion of approach-avoidance dynamics, 
problems of sexual control and compatibility are considered. 


The purpose of this paper is to de- 
velop a theory of human sexual moti- 
vation which may more adequately 
account for human behavior than the 
conceptions now prevalent in the psy- 
chological and lay literature. Despite 
much direct and indirect evidence to 
the contrary, some form of biological 
drive theory is dominant today, couched 
in terms of drive reduction, homeostatic 
disequilibrium, endocrine secretion, or 
instinctual libido. 

To be proposed is a conception of 
sexual appetite based upon psychologi- 
cal hedonism or affective change. This 
statement owes much to the thinking 
and research of F. A. Beach, P. T. 
Young, D. C. McClelland, Kurt Lewin, 
D. O. Hebb, and many others. How- 
ever, no previous attempt has been 
made, as far as is known, to systemati- 
cally pursue the implications of affective 
theory to the development of sexual 
appetite. Further, the present author 
must remain responsible for this de- 


velopment, as those mentioned may 
wish to disown the particular formu- 
lation proposed here. 


INADEQUACY OF ORGANIC 
EXPLANATIONS 


It is beyond the scope of this paper 
to give a comprehensive review of evi- 
dence regarding the adequacy of bio- 
logical needs in accounting for sexual 
motivation. Nevertheless, a brief sum- 
mary of several lines of investigation 
may be stated: 

Phyletic Comparisons. Much of the 
experimental work on sex motivation 
has been done on various species of 
birds and rodents (Beach, 1951). This 
work demonstrates the importance of 
hormonal factors, particularly in the 
females of these species. The tendency 
to extrapolate from these data to hu- 
mans should be strongly tempered by 
several considerations. In the first 
place, there are striking interspecies 
differences. For example, although 
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testosterone propionate treatment accel- 
erates the rate of maturation of sexual 
responses and produces greater than 
normal sexual drive and secretory ac- 
tivity of the accessory sexual organs in 
the rat, it does not do so in the guinea 
pig (Gerall, 1958). Also, studies of 
higher mammals, especially primates, 
show the relaxation of endocrine and 
genetic controls, with experience play- 
ing an increasingly prominent role as 
the human level is approached (Beach, 
1947; Ford & Beach, 1951; Harlow, 
1962; Nissen, 1953). Finally, when 
constitutional or hormonal interpreta- 
tions of sexual motivation are tested 
directly upon humans, they fail to ade- 
quately account for the data, as noted 
in what is to follow. 

Infantile and Childhood Sexuality. 
Cross-cultural data show the extensive 
elaboration of sexual behavior (mastur- 
bation, mutual masturbation, sexual 
games, attempted coitus, coitus itself) 
in cultures which are permissive, en- 
couraging, or demanding of sexual ac- 
tivity prepuberally (Ford & Beach, 
1951; Henry & Henry, 1953). In such 
societies, virtually no secrecy is asso- 
ciated with sex, and little or nothing is 
done to inhibit the sexual activity 
of the youngsters. Ford and Beach 
(1951) report that in a few cultures, 
regular heterosexual intercourse occurs 
from ages 8-10 onward; e.g., among 
the Chewa it is believed that youth will 
not mature unless intercourse begins 
early in life; similarly, the Lepcha be- 
lieve that girls will be infertile unless 
they participate in coitus prepuberally. 

The extent of sexual activity in 
childhood among the peoples mentioned 
above is a powerful demonstration of 
the hypothesis that sexual interest is 
not dependent upon the maturation of 
the sex glands. 

Effects of Castration. When males 
are castrated postpuberally, the effects 
are highly variable, ranging from 
heightened libido to an abrupt cessation 
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of sexual motivation (Hamilton, 1943; 
Tauber, 1940). Tauber, after review 
ing the literature, concluded that the 
attitude of the patient toward the cas- 
tration was a critical factor. 

When females are castrated post- 
puberally, the effects again are variable. 
In Filler and Drezner’s (1944) careful 
study of 41 cases, 36 experienced no 
libidinal change, 3 showed a drop in 
libido, and 2 showed an increase. 

Data are scanty regarding the ef- 
fects of prepuberal castration. Ford 
and Beach (1951) refer to a few cases 
where the male is able to voluntarily 
obtain an erection, copulate, and ex- 
perience orgasm (of course without 
ejaculation) ; these may be atypical in 
our culture, however. Since prepuberal 
castration usually results in underde- 
velopment of the penis, it would bring 
about lessened effectiveness in copula- 
tion, as well as all of the psychological 
implications of masculine inadequacy 
that a small penis may connote to the 
patient. 

Homosexuality. Kinsey (1941) 
convineingly maintains that hormonal 
interpretations are inadequate to ac- 
count for the following facts: (a) 
homosexuality and heterosexuality are 
not mutually exclusive, but coexist in 
combinations of every degree of relative 
preponderance; (b) persons may go 
from heterosexual to homosexual rela- 
tionships and vice versa in a matter 
of hours; or alternatively, there may 
occur months or years of an exclusively 
heterosexual or homosexual relation- 
ship, followed by a reversal of pattern; 
(c) thus far, there is an inability to 
distinguish between heterosexuals and 
homosexuals on the basis of hormonal 
secretion or constitutional typologies. 

In a few societies (Siwan, Aranda, 
Keraki, Kiwai), sodomy is virtually 
universal, with those not participating 
considered peculiar. Commonly, these 
relationships begin at puberty, the boy 
taking the feminine role at first, later 
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- assuming the masculine role (Ford & 


Beach, 1951). Such relations coexist, 
of course, with heterosexual relations. 
In their study of American men, Kin- 
sey, Pomeroy, and Martin (1948) re- 
port that 37 % of their sample had had 
at least one homosexual experience 
resulting in orgasm, though only 4% 
were exclusively homosexual. 

It is true that most American men 
settle into a pattern of heterosexuality, 
while a small minority establish vari- 
ous degrees of bisexual orientation 
(Kinsey et al., 1948). Some members 
of this minority establish definite homo- 
sexual preferences, with an accompany- 
ing way of life which includes a special 
jargon, mien, and social life among 
those similarly disposed (Cleckley, 
1957; Cory, 1961). Whether one de- 
fines homosexuality as dealing with 
these extreme cases, or subsumes un- 
der this term a more widespread per- 
sonality predisposition, the biological 
interpretation is equally tenuous. 

Hermaphroditism. Data on individ- 
uals with confused or mistaken sex 
identity show variations in sexual de- 
sire, fantasy, and social role to be a 
function of the person’s self-conception 
as to which sex he is (usually this is 
dependent upon parental conceptions), 
and not upon actual sex identity as 
determined by gonadal tissue (Ellis, 
1945; Finesinger, Meigs, & Sulko- 
witch, 1942; Miles, 1942; Seward, 
1946). In Ellis’ exhaustive review of 
the literature, those who were true 
hermaphrodites (possessing both mas- 
culine and feminine gonadal tissue) 
were not significantly different from 
the pseudohermaphrodites in direction 
of sex libido. Of the total group 
studied intensively by Ellis, 80% had 
heterosexual libido, 11% were psycho- 
sexually immature, and 9% were 
homosexual or bisexual in libidinal 
interest, percentages which are prob- 
ably fairly comparable to a “normal” 
group. Furthermore, the only bisex- 


uals (those having heterosexual and 
homosexual motivation) came from the 
pseudohermaphrodites, the group least 
expected (biologically) to produce 
them. In virtually every case, the sex 
role and libidinal interest were those 
expected of the sex the individual was 
assumed to be while growing up, with 
little correspondence between sex role 
and libidinal interest on the one hand, 
and somatic characteristics on the 
other. 

More recently, Money, Hampson, and 
Hampson (1955a, 1955b, 1957) and 
Money (1961) have reported results 
of comprehensive medical and psycho- 
logical observations on 105 hermaphro- 
ditic patients. Their results strongly 
confirm Ellis’ findings, but are even 
more striking. Only five of their pa- 
tients had a gender role or libidinal 
orientation ambiguous or deviant from 
that expected for the sex to which 
they were assigned and in which they 
were reared. This was true despite 
the fact that these patients had one or 
more of the following physical vari- 
ables of sex which were incongruous 
with the assigned sex: external genital 
morphology, internal accessory repro- 
ductive structures, hormonal sex and 
secondary sex characteristics, gonadal 
sex, and chromosomal sex. Especially 
striking were cases of the same medical 
diagnosis (e.g., hyperadrenocortical fe- 
male, cryptorchid male hypospadiac), 
some of whom had been reared as 
masculine, others as feminine. Those 
raised as masculine were strikingly dif- 
ferent from those raised as feminine in 
gender role and libidinal orientation, 
which followed their assigned sex in 
virtually every case. 

Treatment of Sexual Problems. At- 
tempts to treat impotence and frigidity 
by hormonal means have proved gen- 
erally unsuccessful, when the effects of 
suggestion (the meaning of the treat- 
ment to the patient) are controlled 
(Carmichael, Noonan, & Kenyon, 
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1941; Creevy & Rea, 1940; Seward, 
1946). Seward (1946, pp. 198-200) 
concludes her discussion of relevant lit- 
erature by maintaining that in virtually 
all cases of frigidity and impotence, the 
difficulty is psychogenic rather than 
organic in character, and that the ef- 
ficacy of psychotherapy as against 
hormonal therapy substantiates this 
diagnosis. 

Periodicity of Sex Desire. Data 
from Davis (1929), Dickinson and 
Beam (1931), Ellis (1936), Hamilton 
(1929), Terman (1938), Tinklepaugh 
(1933), indicate two peaks of sexual 
desire in women, one occurring im- 
mediately prior to menstruation, the 
other occurring just following men- 
struation, with desire at a low ebb 
during the period of peak fertility. 
This is exactly opposite to expectation 
based upon hormonal fluctuations. 
Benedek and Rubenstein (1939, 1942), 
on the other hand, report maximal 
heterosexual drive among patients in 
psychoanalysis during the follicular 
phase of maximum fertility. These in- 
vestigators inferred drive levels from 
reports of fantasy obtained during psy- 
choanalytic interviews. Further re- 
search is needed to clarify the differ- 
ences in results, whether due to 
populations studied, relative validity of 
measures used, or other factors. There 
does seem to be general agreement that 
periodic variations in sex drive are 
difficult for many if not most women 
to report. If such monthly fluctuations 
were of major consequence, it seems 
reasonable to assume that they would 
be more fully reflected in behavior or 
consciousness. 

Although it is part of our folklore 
that, following sexual outlet, post- 
puberal males experience a steadily in- 
creasing sex drive which becomes more 
and more insistent, this belief is most 
probably false. In the words of Beach 


(1956), 


No genuine tissue or biological needs are 
generated by sexual abstinence. It used to 
be believed that prolonged sexual inactivity 
in adulthood resulted in the progressive ac- 
cumulation of secretions within the accessory 
sex glands, and that nerve impulses from 
these distended receptacles gave rise to sex- 
ual urges. Modern evidence negates this 
hypothesis. . . . What is commonly confused 
with a primary drive associated with sexual 
deprivation is in actuality sexual appetite, 
and this has little or no relation to biological 
or physiological needs [p. 4]. 


It seems warranted to conclude that 
the overwhelming proportion of the 
variance in human sexual motivation 
and behavior is not explicable in terms 
of some biological need or tension, how- 
ever conceived. An alternative view is 
explored in this paper, one which uses 
a series of notions based largely on 
cognitive expectancy theory, Lewinian 
field theory, and modern hedonic or 
affective theory. 


PRINCIPLES OF MOTIVATION 


The present theory of sexuality is 
based upon several principles common 
to motives generally, which will be 
stated prior to the specific discussion 
of sex motivation. 


1. Motives are based upon learned 
expectations of an affective (hedonic) 
change (McClelland, 1951, p. 466; 
Young, 1959). These expectations are 
aroused by cues or stimuli which are 
associated with affective states due to 
some previous learning. Such learning 
may be based on the actual experience 
of the individual ; however, it may also 
be vicariously based on affect laden 
fantasies (imaginal processes) aroused 
either by the person’s own thought 
processes or through some form of 
social influence (eg., reading a ro- 
mantic novel). It is not implied that 
imaginal or vicarious experience is 
necessarily as potent a source of moti- 
vation as firsthand, physical experience, 
since the physical experience will or- 
dinarily carry more affect, and may 
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have a richer context of cues (Prin- 
ciples 4 and 5 below). However, 
imagined or vicarious experiences may 
be highly motivating. This is wit- 
nessed by common fears of ghosts, 
dragons, bogeymen, haunted houses; 
behavior following a horror movie 
(locking windows, insomnia, night- 
mares). Johnson’s (1945) report of 
the phantom anesthetist of Mattoon is 
a dramatic illustration. 

2. For such an expectation to mo- 
tivate behavior, the person must be- 
lieve that some action (symbolic or 
overt) on his part helps or is required 
to bring about the desired affective 
change. A motive, then, is an expec- 
tation that the pursuit of a given course 
of action will contribute to, or result in, 
a change in affective state. The ex- 
plicit requirement made here that in- 
strumental behavior is a necessary part 
of a motivational expectation seems to 
go beyond the statement of affective 
theory by McClelland, Atkinson, 
Clark, and Lowell (1953). 

An example may help to clarify this 
point. Suppose a small boy hears the 
bells of an ice cream truck coming 
down the street; his mother buys him 
an ice cream bar; and he thoroughly 
enjoys this tasty treat. There will then 
be a tendency for the bells on subse- 
quent occasions to produce in him 
an expectation of pleasurable affect. 
However, the behavior he is motivated 
to employ is dependent upon the way 
in which the thoughts or actions he 
engages in are conceived as mediating 
between the cues and the affect, or 
serve as affect-producing cues them- 
selves. Thus if the boy had to do 
nothing to receive the ice cream bar, 
he would merely engage in watchful 
waiting upon hearing the bells. If he 
had to coax his mother to purchase 
him a bar, however, the sound of the 
bells would instigate vigorous action 
on his part. 

3. The confidence one places in a 


given affective expectation increases 
each time it is confirmed through hav- 
ing the affect follow or accompany 
the cues which arouse the expectation. 
Conversely, the lack of confirmation 
(infirmation) of an expectation tends 
to destroy belief in it as the predicted 
affective changes fail to follow the 
associated cues (Bruner, 1951). Fur- 
thermore, the extent to which the affect 
and 1 are uniquely associated de- 
termides the degree to which the affect 
will be aroused by only such cues and 
the degree to which the cues will arouse 
only such affect. Continuing the ex- 
ample of the boy, if the occasion men- 
tioned is the only time he hears such 
bells and has such an ice cream bar, 
then it can be said that these two 
things have been uniquely associated. 
If he hears such bells subsequently 
and enjoys this tasty treat each time, 
then he will develop great confidence 
(faith) that when the bells are heard 
a pleasurable eating experience will 
follow. The principle of uniqueness is 
similar to the Bruner-Postman concept 
of monopoly (Bruner, 1951). 

4. There is some positive relation- 
ship between the amplitude of the af- 
fective change and the intensity of the 
resulting motive. 

5. The likelihood of arousal of a 
motive is a function of the variety 
of cues associated with the affective 
change, together with the associative 
strength of those cues to the affect 
(Principle 3). 

The last three principles parallel 
rather closely the analysis by McClel- 
land et al. (1953, pp. 68-73) of factors 
affecting motive strength. 


6. A family of expectations is a 
group of expectations which tend to 
arouse one another or to be aroused 
by a similar set of cues. Confirmation 
of one member of a family of expecta- 
tions strengthens belief in other mem- 
bers of that family. Conversely, in- 
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firmation of one member reduces belief 
in other members of the set. The 
amount of generalized confirmation or 
infirmation is a function of the organi- 
zation of relationships between the con- 
firmed member and the other members 
of the family (see Asch, 1952, pp. 
580-582; Rokeach, 1960, ch. 2). 

These statements are cognitive 
equivalents for phenomena often 
phrased in terms of stimulus generali- 
zation, discrimination, and habit family 
hierarchy. 

7. There is a degree of habituation 
in some affective areas in which repe- 
tition or continuance of an activity 
leads to a lesser affective change than 
that which occurred upon initial partici- 
pation in the activity. For example, 
viewing the Grand Canyon of the Colo- 
rado or the Lower Falls of the Yellow- 
stone is more thrilling the first time 
seen than later. Somehow the “edge 
of newness” is rubbed off, perhaps as 
a function of gratification of the cogni- 
tive need to know the environment, or 
of other habituating factors (see Dem- 
ber & Earl, 1957). 

8. Motives can be grossly catego- 
rized into three types: (a) Approach, 
in which the expectations are of a posi- 
tive affective change; (b) Avoidant, in 
which the expectations are of a negative 
affective change; (c) Ambivalent, in 
which both positive and negative ex- 
pectations are aroused by the same 
cues. 

9. The tendency to act on the basis 
of some motive is a function of (a) 
the relative net affective strength of 
the alternatives aroused in a given 
situation; and (b) the subjective prob- 
ability of attainment of the goals in- 
volved (see Atkinson, 1957). The net 
affective strength of a course of action 
is the algebraic sum of the positive and 
negative affectivity associated with it. 
When the difference in Factors a and 
b between two alternatives is great, 
other things equal, the decision is 


relatively easy; when the alternatives 
are about equal, decision is difficult, 
with vacillation and decision time in- 
creasing (Cartwright, 1940; Cart- 
wright & Festinger, 1943). 

10. Other things equal, the more 
immediate the affective change the 
more attractive or compelling is a given 
alternative as opposed to a change more 
distant; that is, as the psychological 
distance to a goal increases, the less 
motivating it is (Miller, 1951). This 
time-distance gradient is much steeper 
for children than adults, who live in 
a wider time perspective (Lewin, 
1946). The immediacy or delay of 
gratification may be related to the 
subjective probability or certainty of 
attainment. For the hunter, a bird in 
the hand is indeed worth two in the 
bush because he may not get the birds 
in the bush. 

11. The perceived permanence or 
transience of the anticipated affective 
state is relevant in assessing the net 
affect of a given alternative. Stealing 
may bring money for a temporary 
splurge, but may bring lengthy incar- 
ceration, distrust by friends, chronic 
twinges of conscience. Other things 
equal, affective states perceived as be- 
ing relatively permanent will be more 
compelling than those relatively tran- 
sient. Again, the differences between 
children and adults in time perspective 
should be noted. This dimension will 
be less important for children because 
of their relative inability to foresee 
long-term consequences (Lewin, 1948, 
p. 105). 

12. Another pertinent problem in the 
analysis of choice behavior is the prob- 
lem of mutual exclusiveness. Can a 
person take one alternative and also, 
perhaps later, another; can he have his 
cake and eat it too? Obviously, the 
perception of mutual exclusiveness may 
make a decision much more difficult 
than when both goals are seen as 
jointly attainable. 
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13. Closely related to the above is 
the finality or irrevocability of the 
decision. For example, if marriage is 
conceived of as a permanent bond be- 
tween two people, the decision to marry 
a given person will be a more difficult 
one than if marriage is seen as an ar- 
rangement to be dissolved at the con- 
venience of one or both of the partners. 


With the foregoing tentative princi- 
ples stated in general form, attention 
can now be directed specifically to sex- 
ual motivation. 


AFFECTIVE FOUNDATIONS OF 
SEXUALITY 


A theory which assumes that motives 
are learned in association with affective 
experience must, in order to be com- 
plete, describe those conditions which 
produce affective change in the first 
place. McClelland, recognizing this 
problem, has analyzed it with regard 
to taste. One of his generalizations is 
that mild states of stimulation produce 
pleasure (positive affect), intense 
states produce pain (negative affect), 
with the range of intensities involved 
dependent upon the specific sensory 
quality, such as sweet, sour, salty, or 
bitter (McClelland, 1951, pp. 467- 
470). Young (1959) has made a simi- 
lar analysis based upon food prefer- 
ences as related to concentration in 
solution. 

It is postulated that certain condi- 
tions, which innately are affectively 
positive, form the constitutional base 
for the elaboration of sexual appetite : 


1. Mild local stimulation of the 
genitals is innately pleasurable. This 
is a matter of common observation by 
parents. The random exploration of 
the body by the infant’s hands soon 
leads to an undue amount of time spent 
stimulating the genital organs unless 
some preventive action is taken. The 
phenomenon is more easily noticed in 
the male child because of the greater 


accessibility to stimulation of the penis 
than the vulva and because of the ob- 
servability of the phenomena of tumes- 
cence and erection consequent upon 
such stimulation, Adults of the Ati- 
melang tribe of Alor, Indonesia, have 
capitalized upon the pleasurable char- 
acter of such stimulation; they may 
deliberately stimulate the genitals of 
the child to soothe and quiet him (Kar- 
diner, 1945, p. 131). This also occurs 
among the Hopi, Siriono, and Kazak 
(Ford & Beach, 1951). The pleasure 
accompanying genital stimulation 
(arousal) continues throughout life as 
an affective base for motivational de- 
velopment. Many stimuli, once neu- 
tral, may become associated with it, as 
a consequence functioning as cues lead- 
ing to genital arousal, even in the ab- 
sence of tactual stimulation itself. 

2. The phenomenon of genital cli- 
max, together with its sequel, relaxa- 
tion, is highly pleasurable. It is 
postulated that arousal short of climax 
is pleasurable, though not as gratifying 
as arousal leading to climax (see Ka- 
gan, 1955; Sheffield, Wulff, & Backer, 
1951; Whalen, 1961, for experimental 
evidence in rats). Ford and Beach 
(1951), relying largely on data from 
Dickinson, maintain that masturbation 
in women perseverates among those 
who experience orgasm if no feelings 
of guilt or disgust ensue. While those 
who fail to achieve orgasm tend more 
often to quit, some persist in the 
practice for years without it. 


In men, the pleasures of “girl watch- 
ing,” voyeurism, viewing “cheesecake” 
photography (Clark, 1952; Mussen & 
Scodel, 1955), and reading suggestive 
literature are well known, though par- 
ticipation rarely culminates in orgasm. 
Reference is made to Beach’s (1956) 
tentative analysis of the sexual arousal 
mechanism (SAM) and the intro- 
missive and ejaculatory mechanism 
(IEM). He suggests these as separate 
and sequential mechanisms in the male, 
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and further suggests that the TEM has 
changed little in the course of evolution 
and is under poor voluntary control, 
but that the unique features of human 
sexuality are consequent upon the great 
lability of the SAM. Both the SAM 
and the IEM are here presumed to be 
highly charged with positive affect. 


CuitpHoop SEXUALITY 


In some societies (eg, Alorese, 
Pilaga of Argentina), masturbation 
develops early and is maintained un- 
abated through the childhood years, 
sometimes being elaborated into mutual 
masturbation and other forms of sex 
play, both homosexual and heterosex- 
ual. These forms of erotic expression 
would seem to be learned through the 
association of the pleasurable affect 
with the tactual stimulation producing 
it. It must be noted, of course, that 
other sources of affect than the erotic 
are frequently involved in motivations 
toward sexual behavior. An examina- 
tion of the data of Henry and Henry 
(1953) on Pilaga Indian children re- 
veals the importance of several addi- 
tional factors. Sex games are used 
to manifest hostility and dominance. 
Also, parental intercourse is openly 
viewed by the children from a very ten- 
der age. Here the child is excluded 
from a social experience of powerful 
affect, and it may well be that in both 
his doll play and interpersonal play, 
the Pilaga child is seeking to capture 
the warmth of inclusion in such an 
affect laden experience. While in the 
present article the primary focus is 
upon the erotic or lustful affect asso- 
ciated with sex, it must be remembered 
that this affective component is nearly 
always integrated with other sources 
of affect in the sexual behavior of hu- 
mans of all ages. 

In many societies, more or less active 
steps are taken to limit or prevent the 
masturbatory efforts of children, and 
to punish other presumed evidences of 
sexual curiosity or interest such as 


boy-girl exploration of genitals, public 
nudity, use of sex words, etc. The 
Victorian mores of nineteenth century 
Vienna are generally recognized today 
as the basis for Freud’s postulated 
latency period, in which a dearth of 
conscious or overt sexual material oc- 
curs in (approximately) the sixth 
through the twelfth years. In his 
scheme the latency period follows 
closely upon the héels of the “phallic” 
stage in which the libido is presumed 
to be localized for the first time in the 
genitalia. This sequence is understand- 
able, for it is precisely the behavioral 
evidences of the phallic stage noted 
above which evoke social sanctions of 
sufficient power to provoke inhibition 
or concealment of sexual expression. 
A wide range of sanctions may be em- 
ployed, far beyond the veiled or ex- 
plicit castration threats deemed to be so 
important by Freud. Verbal judg- 
ments, vague threats, corporal punish- 
ment, shaming, teasing, isolation, denial 
of privileges, distraction are some of 
the devices used (Sears, Maccoby, & 
Levin, 1957). In this way, expressions 
of sexual motivation may be largely 
prevented until the time of life deemed 
culturally appropriate. 

From the present theoretical frame- 
work, the similarity of childhood sex- 
uality to adult sexuality is basically a 
function of the comparative sophistica- 
tion about sex based in affective ex- 
perience, though without the potent af- 
fect associated with orgasm, a lesser 
childhood intensity of motivation would 
usually be expected. For the child to 
desire intercourse (e.g., with the op- 
posite-sex parent), he would have to 
know about intercourse, would have 
to have positive affective expectations 
about it, and would have to perceive the 
person involved as a potential sex ob- 
ject. Among the Pilaga Indians, where 
such conditions seem to pertain, little 
difference in sexual motivation other 
than in intensity might be expected be- 
tween children and adults. But great 
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differences would be expected between 
the naive, carefully protected United 
States middle-class child and his par- 
ents. To impute to the child the sexual 
desires of the experienced adult is 
entirely unwarranted unless the child 
has the requisite background of affec- 
tive learning. 

Clinical studies by Levy (1937), 
Goldfarb (1944, 1945), Spitz and Wolf 
(1949), and others have suggested 
that extreme social deprivation in in- 
fancy produces severe and lasting dis- 
ruption of affectional (and intellectual) 
processes, though others dispute this 
(Dennis & Najarian, 1957; Pinneau, 
1955). The Harlow (1962) and the 
Harlow and Harlow (1962) ingenious 
studies of the development of affec- 
tional systems in monkeys give promise 
of identifying the critical variables (at 
least for monkeys!), though it is too 
early for results to be definitive. In 
the rhesus monkey, social deprivation 
in infancy appears to produce lasting 
inability to exhibit adequate mating 
behavior, but lack of peer relationships 
seems far more critical than the absence 
of the mother-child relationship. It 
should be noted that in Dennis and 
Najarian’s study, the infants had active 
peer relationships after the first 6 
months or so, though they were almost 
totally deprived of relationships with 
adults. Although these children 
seemed somewhat retarded in infancy, 
they appeared to be normal when 
tested at a later age (3-5 years). 

In any event, there seems to be 
mounting evidence that extreme social 
deprivation in early childhood may 
produce pathological effects upon the 
development of “normal” heterosexual- 
ity, supporting the thesis that sexuality 
is in large part a derivative of experi- 
ential factors. 


DEVELOPMENT OF SEXUAL APPETITE 
IN DATING AND COURTSHIP 


In order to take a closer view of the 
development of sexual motivation, let 


girl who have entered our culturally 
prescribed period of heterosexual activ- 
ities, the dating period. By this time, 
say at age 15, they cach have gone 
through a long period of largely inci- 
dental learning about such “presexual” 
activities as holding hands, kissing, and 
embracing. Exposure to the mass 
media, “popular” songs, the romantic 
ventures of older siblings, teasing about 
girl friends or boy friends—indeed, 
myriads of influences have given rise 
to the development of a family of re- 
lated expectations that engaging in 
behavior such as holding hands, walk- 
ing arm in arm, kissing, and embracing 
with the “right” (culturally delimited) 
person leads to a positive affective ex- 
perience. These activities are per- 
ceived as related to one another, and 
thus they constitute a family or system. 
In the absence of personal affective 
experience, these expectations are 
not very compelling in motivational 
strength; but they may, under the 
proper conditions, lead to the initiation 
of such behavior. These conditions 
would include the perception of the 
partner as an appropriate object for 
romantic interest, a social situation of 
privacy or intimacy where affectively 
negative consequences resulting from 
kidding, tattling, etc., are minimized, 
and a developing relationship between 
the couple such that the girl is per- 
ceived as providing encouragement to 
the attentions of the boy. 

As the boy becomes more assured 
by the conditions mentioned to expect 
positive consequences from his attempt 
to hold the girl’s hand, the motivations 
leading to such behavior will begin 
to outweigh those restraining such ex- 
pression, and he will initiate the activ- 
ity. Assuming he has appraised the 
situation correctly, his grasp of the 
girl’s hand will be reciprocated by the 
girl. In our society the chances are 
excellent that this contact will produce 
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a pleasurable emotional state, the oc- 
currence of which confirms the pre- 
viously held expectation, and also other 
members of the set of expectations. 

Why does holding hands produce 

this emotional response? Obviously, it 
is not because of some innate connec- 
tion between receptors in the hand and 
neural centers of affective arousal, 
erotic or otherwise, since equivalent 
stimuli from other persons do not pro- 
duce this response at all. Rather, this 
reaction is produced because of the 
learned symbolic significance of holding 
hands with the appropriate person. 
And there need not be any sexual con- 
notation attached. This special inti- 
macy may gratify a need for psycho- 
logical closeness with someone; or it 
may fulfill needs for status building by 
meriting such a relationship with a 
person favorably placed in the status 
system of adolescent culture. Simply 
put, it is satisfying to be able to hold 
hands, with someone perceived as at- 
tractive (in the modern slang, “real 
cool”), and doubly satisfying if this 
person is generally so regarded by one’s 
age-mate reference group. The im- 
portance of esteem motives in dating 
has been described for an earlier gen- 
eration by Waller (1937), and more 
recently has been graphically portrayed 
by Margaret Mead (1949, pp. 281- 
295). Other motives may be involved, 
of course, but the point is that initial 
participation in hand holding and fur- 
ther romantic activities generally origi- 
nates in nonsexual motives, and further, 
that the positive affect arising from the 
activity serves to strengthen motivation 
for it. 

As the dating period continues, the 
tendency for the couple to engage in 
kissing is fostered by the increased mo- 
tivational strength generalized from 
hand holding, also by the growing 
intimacy of the relationship which al- 
ters the distribution of positive and 
negative expectations, e.g., the partner 
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permitting or encouraging such a thing, 
Assuming that the kiss is attempted 
and is mutually acceptable, a positive 
affective state is likely to result, with 
consequences comparable to those dis- 
cussed above for hand holding. If 
the girl’s lipstick tastes good, there may 
be another reward, which Young 
(1952) would call the “palatability 
effect”! 

But an additional development is 
going on at the same period of time. 
The couple is getting more educated 
about things sexual through a more 
sophisticated exposure to the “facts of 
life” via reading, age-mate conversa- 
tions, and a host of other sources. So 
a new set of expectations is established 
about “petting,” sexual intercourse, and 
the like, with affect more or less vividly 
imagined. This new set may have 
only a tenuous relationship with the 
former set initially, a bond which will 
become firmer as the association value 
between the sets increases. 

At this point, application needs to 
be made of the principle of habituation, 
which states that the gratification which 
comes from an initial experience occurs 
with lessened intensity upon subsequent 
occasions. The dulling of the thrill 
with repetition may lead the person or 
couple to try out new activities in order 
to regain the prior level of gratification. 
In the case of the hypothetical couple 
this habituation effect heightens the 
likelihood that they will resort to pro- 
gressively more intense love-making, 
e.g., embracing, “necking,” petting, as 
they continue to engage in amatory 
behavior. 

The confirmation of their affective 
expectations, the increase of their ro- 
mantic appetite with gratification, and 
the dulling of the thrill of any given 
activity with habituation all combine to * 
move the couple along the path of more 
affect laden experiences. 

Now eat some point in this sequence 
there will occur a true sexual arousal 
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(lust). The specifically sexual con- 
notations of certain acts, €g, kissing 
the ear, fondling the breasts, indeed, a 
host of verbal or gestural cues of affec- 
tion or assent, can set off this reaction, 
Once this erotic arousal has occurred, 
bringing powerful excitement in its 
wake, the desire to repeat and continue 
the experience is greatly enhanced; 
thus the sexual appetite is increased. 
Another important event also takes 
place, however. A new set of associa- 
tions or meanings is formed. The ac- 
tivities which led up to the initial erotic 
arousal now have a tendency to serve 
as cues leading to the arousal of sexual 
desire. Furthermore, as erotic experi- 
ences are repeated (a) the greater the 
association value of these cues to sex- 
uality, and (b) the wider the range of 
cues to sexuality. Thus, kissing, em- 
bracing, exposure of body parts ordi- 
narily covered, and a host of other acts 
may serve as sexual stimuli to the per- 
son with a history of arousal, while 
to the naive person they have no such 
significance. A 9-year-old, for ex- 
ample, may find intimate scenes in the 
movies boring while they may be highly 
arousing to his experienced elders. 
Summarizing, the positive affect ac- 
companying sexual arousal increases 
the appetite for such experiences; and 
the occurrence of such experiences pro- 
duces an association of the preceding 
stimuli to the sexual affect, thereby 
making these stimuli cues leading to 
the arousal of sexual desire. These de- 
velopments, together with the habitua- 
tion phenomenon, tend to carry the 
participants toward the point where the 
couple may become so excited that it 
is virtually impossible for them to stop 
short of climax, In Beach’s terms, the 
IEM may become sufficiently aroused 


for the male that it carries itself to com- 


pletion with virtually no voluntary con- 
trol. This may occur without actual 
intromission (petting to climax), 
though intercourse will tend to follow 


to the behavior discussed above, and 


consistent with appetitive 
though admittedly they are subject to 


behavior, it will be necessary to go 
beyond his study and those of Kinsey 
et al. (1948, 1953) to two kinds of re- 
search: that which is more experi- 
mental in character, providing neces- 
sary control of variables, and that 
which studies individual cases more in- 
tensively on a longitudinal basis. 


PROBLEMS oF APPROACH AND 
AVOIDANCE 


To this point, only the positive affect 
has been considered. However, there 
are several sources of negative affec- 
tive expectations surrounding sexual 
behavior which need to be considered 
to give an adequate treatment of sex 
motivation. 

One important source is the guilt 
arising out of moral compunctions 
against various expressions of sexual- 
ity. These arise in varying degree 
among persons in the great bulk of 
societies throughout the globe. A 
second source is the fear of discovery, 
of bringing disgrace to one’s 
and ruination to one’s reputation. For 
the unmarried, a third source, the fear 
of pregnancy, is compounded from the 
guilt and fear already mentioned, to- 
gether with the loss of freedom accom- 
panying a forced marriage. For both 
the married and unmarried other 
anxieties attendant to pregnancy may 
arise: threats to health, the stress of 
psychological and economic adjust- 
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ments to the baby. A fourth source of 
concern is the fear of being “used,” 
with the loss of interest by the partner 
once he has gotten what he wanted. A 
fifth source common to women is the 
disgust and revulsion toward sex pro- 
duced by the doctrine that sex is filthy 
and degrading (French Institute of 
Public Opinion, 1961, pp. 159-163). 

The above list of sources of negative 
affect is not meant to be exhaustive, 
but to lend credence to the proposal 
that one or both partners in a sexual 
relationship may have potent ambiva- 
lent feelings toward it. 

A Lewinian analysis of driving and 
restraining forces (Lewin, 1947) is 
useful in describing the approach-avoid- 
ance dynamics involved in sexual con- 
flict and the management of sex 
appetite. 

In American society, the burden of 
the responsibility for the control of 
sexuality rests upon the female. In 
the dating and courtship period, a girl 
may decide that she will allow a boy 
to go only so far, and then she will 
“draw the line.’ Thus, behavior will 
occur at a given level because of the 
balance of driving forces with restrain- 
ing forces. This is an unstable equi- 
librium, however, and the device there- 
fore ineffective if love-making is 
habitual up to this point, since the in- 
creased appetite may overbalance the 
restraints, causing a shift in the line. 
The resulting guilt, anxiety, and cogni- 
tive dissonance are then assuaged by 


rationalization and other available 
defenses. 
One prevalent manifestation of 


ambivalence about sex is the divorce- 
ment of sexual qualities and feelings 
from conceptions of the “good” mate. 
Thus a young man may coax a girl 
to grant sexual favors, then lose inter- 
est in her as a prospective wife when 
she does, since he cannot conceive of 
marrying a girl who is promiscuous! 
Because “love is wonderful but sex is 


nasty” the girl, too, is likely to perceive 
the boy who wants to make love as one 
who is untrustworthy, one who is per- 
haps not interested in her but only in 
what selfish satisfaction he can obtain. 
Consequently, while the couple is verb- 
ally reassuring one another of their 
affection while they make love, they 
actually experience mounting doubts 
about the genuineness of their devotion 
as the frequency and intensity of the 
love-making increase. Both before 
and after marriage the enjoyment of 
sex may be colored by a loss of respect 
for the partner as one who is no longer 
pure and sweet. This occurs because 
the good person, shorn of sexual qual- 
ity, is overidealized. When erotic 
characteristies are discovered in that 
person, disillusionment results. 

The very personal gratification which 
accompanies sexual activity lies at the 
heart of the enigma of love and sex, 
since it raises the eternal question: Is 
the other person making love to obtain 
selfish satisfaction or out of a more 
inclusive feeling for me and an altruis- 
tic concern for my happiness? The 
dissociation of love and sex just dis- 
cussed is one attempt to resolve this 
problem. Another is the doctrine that 
love and sex are virtually synonymous. 
While this theory typifies much of the 
wishful content of the mass media to- 
day, physical (sexual) attraction is 
probably not convincing to the lovers 
themselves as a criterion of love. 

Reiss (1960), in his analysis of 
American premarital sex standards, 
suggests that there is a generally rec- 
ognized distinction between sex for 
sex’ sake (body centered coitus) and 
sex for love’s sake (person centered 
coitus). Most American women re- 
serve intercourse for lovers, as an af- 
fectionate expression of love. The 
quandary which exists is not merely 
whether the woman “really” loves the 
man, but whether the man really loves 
her. If the man is sexually opportun- 
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istic, aggressive, or demanding—and a 
wealth of data (e.g., Ehrmann, 1959; 
Kinsey, Pomeroy, & Martin, 1948; 
Reiss, 1960) suggests that the majority 
of American men are—then the woman 
is left to seek for nonsexual assurances 
that he really does love her. The 
more insistent he is about sexual rela- 
tions, the more unsure of his love she 
becomes. If they are not married, the 
uncertainty is magnified, because of 
the lack of marital responsibilities as- 
sumed by the lover. 

A specialized consequence of ambiv- 
alence about sex is the development of 
fetishism or other bizarre aberrations. 
The case described by Grant (1953) 
may be chosen as an illustration of 
this, though his data are insufficient 
to make any analysis certain. In such 
cases, a dissociation occurs between 
one’s erotic desires and more legitimate, 
upright impulses, and a corresponding 
split develops between sexual and lofty 
qualities perceived in others. Thus, in 
Grant’s case, the patient in earlier 
years could have sexual escapades with 
a woman of easy virtue and a platonic 
relationship with a good woman toward 
whom no erotic interest could be toler- 
ated; and “never the twain shall meet.” 
The patient’s resolution of his problem 
by developing a fetish about shoes, 
legs, and leg movements may be inter- 
preted as an attempt to control deep- 
lying feelings of guilt about licentious 
relationships with women. In view of 
this, his final solution, which consisted 
of viewing art films emphasizing leg 
displays, seems rather clever because 
no one else is involved, guilt is thereby 
minimized, and his own risk of ex- 
posure is virtually eliminated. 

Cases of sexual assault, exhibition- 
ism, and the like may result from the 
eruption of sexual desire through the 
barriers imposed by morality and 
reality testing. The development of 
such a breakthrough would be ex- 
plained in the following fashion. The 


person engages in a period of lustful 
fantasy, in many cases stimulated by 
romantic or pornographic material. 
The excitement which accompanies 
this activity is sufficiently rewarding to 
increase the appetite. If such stimula- 
tion occurs over a protracted period, a 
greatly aggravated appetite may ensue. 
If socially acceptable channels of ex- 
pression are denied the person, he may 
more or less planfully enter a social 
setting of erotic outlet where the 
chances for detection and capture are 
reduced. Thus the avoidance forces 
deterring him from such an act, re- 
duced by lowered expectations of nega- 
tive consequences, are overwhelmed by 
the mounting approach tendencies, 
which increase as the lascivious long- 
ing continues. 


VALUATIVE APPLICATIONS OF 
THE THEORY 


A brief attempt will now be made to 
apply appetitional theory to problems 
of the personal and social management 
of sexuality, as follows: premarital re- 
lations, masturbation, control of sexual 
appetite, and sexual compatibility in 
marriage. 


Premarital Relations 


Some observers have argued that 
American premarital sex mores should 
be made more lenient. Their argu- 
ment is based upon evidence that there 
is widespread violation of these stand- 
ards. Such a position assumes that 
the standards are more flexible than 
the behavior; therefore, the standards 
should be altered to conform to the 
realities of conduct. If one assumes 
that libido is a function of biological 
sex maturation, therefore subject to 
limited control, then since American 
culture tends to delay marriage for a 
number of years beyond sexual matu- 
rity, it follows that the violation of 
standards of premarital chastity is 
virtually inevitable. 
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However, if one assumes that bio- 
logical maturation is only a minor 
factor in the development of sexual 
desire, and that sex appetite is subject 
to a tremendous degree of control, 
then other conclusions are permitted. 
Furthermore, if there are negative 
personal-social consequences of pre- 
marital indulgence which outweigh 
the positive consequences, as suggested 
by the Judeo-Christian moral code, 
then one might conclude that the be- 
havior should be altered rather than 
the standards. 

If such a conclusion be entertained, 
the question then follows: On what 
does the behavior depend, and how 
may it be altered? Based on the 
above analysis of the development of 
sexual appetite, several factors can be 
identified as contributive to the growth 
of libido: (a) Early dating where pre- 
sexual behavior is permitted and even 
encouraged. (b) “Going steady” early 
in the courtship years, which tends to 
increase the likelihood of the next fac- 
tor. (c) A dating pattern which is 
characterized by chronic love-making. 
(d) The bombardment of adolescents 
with sexually stimulating materials of 
many kinds, such as suggestive or ob- 
scene literature, sexy motion pictures, 
etc. It might be noted that the drive- 
in theater is called a “passion pit” not 
because of climatological factors which 
generate gonadal secretions, but be- 
cause the lustful pictures frequently 
shown stimulate and legitimize sexual 
expression, and the darkened privacy 
of the automobile lowers other barriers 
to such behavior. (e) The lowering 
of moral standards which serve to re- 
strain immediate gratification of im- 
pulse. Emphasis upon and idealization 
of glamor, physical attractiveness, and 
fun tend to produce a decreased con- 
cern with other values which depend 
upon long-term action. The absence 
of moral training in a setting of sex 


sophistication leaves a crucial void in 
restraining forces. 


Masturbation 


The use of self-stimulation as a 
source of erotic pleasure is reported to 
be very widespread among American 
adolescent males, and also to be com- 
mon among females (Kinsey, Pom- 
eroy, & Martin, 1948; Kinsey, Pom- 
eroy, Martin, & Gebhard, 1953). 
recognition of this has led to a 
ous reversal of a former posi 
Whereas previous generations 
warned not to masturbate because they 
might thereby become insane or abnor- 
mal, many argue today that those who 
fail to indulge in this practice must 
somehow be unbalanced or abnormal. 
The limitations of such a statistical 
conception of normality are too well 
known to require comment. 

The present generation would seem 
to be “victims” of the misconception 
that periodic sexual outlet is bio- 
logically required. This notion is very 
handy, and is readily linked to the 
feeling that one might as well enjoy it, 
controlling evocation of the impulse so 
as to maximize pleasure. 

Needless to say, where moral stand- 
ards exist which forbid such grati- 
fication, approach-avoidance conflict 
exists. As in the case of premarital 
intercourse, conflict can be resolved by 
changing standards. However, it may 
be preferable to alter the behavior to 
fit the standards. 


Control of Sexual Appetite 


A device sometimes suggested to 
control sexuality is to reduce tension 
by displacement or sublimation, i.e., to 
relieve sexual drive by reading, danc- 
ing, painting, attending the theater, etc. 
According to appetitional theory, if 
the substitute activity is sexually stim- 
ulating, then the appetite will tend to 
increase, and the problem of control is 
worsened. If the activity is nonsexual 
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in character, sexual impulses will not 
be relieved nor aroused. If a person 
finds enough such activities which are 
satisfying because of nonsexual ren- 
sons, his need for sexual gratification 
will be lessened, not because of sub- 
limation of libido,. but simply because 
the role of sexual gratification in the 
total motivational economy of the per- 
Sönality will be relatively minor. In 
contrast to Freud's view that much 
human activity gratifies a sublimated 
libido, it seems probable that sex is 
often indulged in as a substitute for 
a reasonably satisfying life in other 
areas. 

Appetitional theory suggests a set of 
recommendations for the control of 
lust during the premarital period. 


1. Participation in activities which 
fill one’s life with abundant gratifica- 
tions, so that sexual intimacies are not 
indulged in as an escape from bore- 
dom or emotional starvation. Athletic, 
intellectual, altruistic, occupational, 
avocational, esthetic, or social pursuits 
may serve this function. 

2. Comprehension of the nature of 
sex motivation and the factors which 
govern it, to help the person to fore- 
see the consequences of alternative 
courses of action, thus enabling him 
to make a knowledgeable decision in 
terms of his own motives and values. 

3. Prevent the appetite from grow- 
ing by abstinence from arousing activ- 
ities until the time acceptable for 
sexual expression. This implies a 
dating-courtship pattern which is cen- 
tered upon a wide variety of nonsexual 
activities which additionally might con- 
tribute to a more adequate prediction 
of marital compatibility. (See Leuba, 
1954, for a related set of recommen- 
dations.) 

4. Exposure to entertainment, read- 
ing material, companionship, etc., 
which have nonsexual connotations of 


intrinsic interest to the person. This 
* 


will tend to prevent the recurrent 
arousal of sexuality. 

5. Emphasis upon the long-term 
rewards of continence, and idealiza- 
tion of the ability to delay immediate 
gratification of impulse. 

The implication should not be 
drawn from the preceding suggestions 
that an overprotective, authoritarian 
control of behavior is favored. The 
emphasis recommended is on self- 
selection of situations and activities. 
Of course, in a society as exploitative 
as America, some degree of social con- 
trol of blatant obscenity is probably 
required. 

Sexual Compatibility in Marriage 

The problem of the role of sexual 
compatibility in marriage is 
area of application of the theory. It 
is sometimes maintained that sexual 
compatibility is of central importance 
to a successful marriage. Based upon 
this argument is the notion that a 
couple needs to have intercourse before 
marriage to find out if they are 
compatible. 

Appetitional theory supports a con- 
trasting view. Rather than conceiving 
of a person’s sexual drive as a rela- 
tively fixed quantity which can only 
be compared with that of another indi- 
vidual, the present theory maintains 
that the appetite is a variable quantity 
which might be adjusted over a period 
of time to match that of another. 
Thus a couple whose initial appetites 
are at variance may work toward a 
mutual adjustment of these until 
greater harmony exists. Their wil- 
lingness to do this probably depends 
upon their nonsexual love for one 
another. 


SUMMARY 


1. An appetitional theory of human 
sexual motivation is pr as a 
needed alternative to theories based 
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upon sex as a biological drive or in- 
stinct in view of the mounting evi- 
dence against the adequacy of physio- 
logical interpretations. 

2. The theory is based upon a gen- 
eral motivational orientation which 
maintains (a) the affective conse- 
quences of behavior are central in the 
development of motives, through the 
mediation of associations between cues 
and affect. These cues, perceptual or 
cognitive, tend to arouse expectations 
that a given pattern of action will pro- 
duce a certain affective change. For 
an expectation to be motivational in 
character, it must involve not only 
affective change, but also a course of 
behavior, symbolic or overt, which is 
instrumental to attaining the desired 
affect; (b) motives are strengthened 
in proportion to the degree of affective 
change experienced; (c) the likelihood 
of arousal of a motive is a function of 
the likelihood of occurrence of cues 
to the affective change and the associa- 
tive strength of the cues to the affect; 
(d) all members of a family or set of 
expectations are reinforced by the 
confirmation of one member of the 
set along a gradient of generalization; 
(e) a habituation process occurs in 
many affective areas wherein the repe- 
tition of a given activity produces a 
diminished affective change in com- 
parison with the initial experience; (f) 
approach motives are based upon posi- 
tive affective expectations, avoidance 
motives upon negative affective expec- 
tations; however, most motivated be- 
havior is ambivalent, based upon a 
mixture of positive and negative com- 
ponents; (g) of a number of expecta- 
tions aroused at any time, the one 
acted upon depends on the relative 
positive affective change anticipated 
over a period of time, and the subjec- 
tive probability of attainment of the 
alternatives. 

3. The affective foundations for 
genital sex motivation (lust) are (a) 


the pleasurable excitement of arousal, 
and (b) the intense pleasure accom- 
panying orgasm and the relaxation 
which follows. Either of these 
sources of affect is adequate to 
strengthen sexual appetite. It is rec- 
ognized that there are many nonlustful 
motives which contribute to participa- 
tion in sexual behavior. 

4. Childhood sexuality is based pri- 
marily upon the first affective founda- 
tion, and becomes more potent with 
indulgence ; however, in many societies 
the precautions. taken to prevent 
arousal together with various threats 
of negative consequences are usually 
adequate to keep such behavior sub- 
dued to a rather inconsequential level. 

5. The burgeoning of sexual mo- 
tivation usually occurs during the cul- 
turally prescribed period of courtship. 
Middle-class American society is used 
as an illustration. The person or 
couple engages in amatory adventures 
of increasing intensity due to the effect 
of reinforcement, generalization of re- 
inforcement, and habituation. At 
some point in this sequence true genital 
arousal (lust) occurs, its affective 
power greatly heightening the motiva- 
tional level. Chronic participation in 
arousing activity augments the appe- 
tite, which overwhelms even strong re- 
sistances composed of moral compunc- 
tions and various fears, eventually 
leading to intercourse. 

6. The theory is applied to prob- 
lems in the management of sexual 
appetite in premarital courtship and 
masturbation and to the role of sexual 
compatibility in marriage. 
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It is proposed that figural aftereffects are produced by the same 
neurophysical mechanism as are simultancous and successive contrast. 
Observations on lateral inhibition in the eye of Limulus show that 


after an ommatidium has ceased to 
its neighbors, 
neighbors a postinhibitory rebound. 


be more highly illuminated than 


it undergoes a postexcitatory depression and its 


If in addition we assume that 


the apparent distance between 2 points is measured in the nervous 
system by summing the excitability of the elements between them, 


then distortions of phenomenal space will occur around previous! 


excited points. These are seen to 


y 
correspond to those observed in 


figural aftereffects, €g, in showing a distance paradox. Various 


consequences are discussed. 


Writing of their elegant theory of 
inhibitory interaction in the Limulus 
eye, Hartline, Ratliff, and Miller 
(1961) state, 


The value of such a theoretical formulation 
lies in the compactness with which a num- 
ber of experimental observations are de- 
scribed, and the insight it gives into the 
understanding of relationships that, while 
inherently simple, sometimes yield phe- 
nonmena of considerable complexity. Ap- 
plied with caution, it may be useful as a 
basis for hypothetical explanations of such 
phenomena as brightness contrast and sensi- 
tivity to moving patterns in human vision. 


In this paper, it is hoped to show that 
figural aftereffects in the human visual 
system stem from neural properties 
qualitatively similar to- those found 
in the eye of Limulus. Systems show- 
ing inhibitory interactions like those 
found in the Limulus eye have been 
described by Kuffer (1953), Hubel 
and Wiesel (1959), and Baumgartner 
(1961) in the visual system of the 
cat, though here such systems show 
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features of added complexity. As the 
present hypothesis assumes only the 
inhibitory interactions found in the 
Limulus eye in explaining a phe- 
nomenon in human vision, it may well 
turn out to be oversimplified, though 
a theory any more elaborate would 
hardly be justified in our relatively 
primitive state of knowledge. 


Tue HYPOTHESIS 


The basic assumptions of the pres- 
ent hypothesis are mainly those which 
already have been observed in many 


sensory systems (see review by 
Brooks, 1958). It is assumed that 


an excited element inhibits other less 
excited elements, the degree of such 
inhibition falling off with distance. 
Furthermore, it is assumed that when 
the excited element ceases to be ex- 
cited, its excitability will transiently 
diminish below its previous baseline, 
and as a result, the elements, which 
were previously inhibited by its ac- 
tivity, will now undergo a correspond- 
ing transient increase in excitability. 
Such a relation has been shown in the 
eye of Limulus and is illustrated in 
Figure 1 (Ratliff, 1961). 
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Fic. 1. Two adjacent ommatidia in the 
eye of Limulus and their inhibitory inter- 
action. (One [black filled circles] was 
illuminated steadily throughout the entire 
period on the graph. The other om- 
matidium [open circles] was illuminated 
steadily until the point marked O on the 
graph. A sudden increment of illumina- 
tion, remaining steady for two seconds, then 
occurred. At the end of the two seconds, 
its illumination was abruptly decreased to 
its original level. Of special interest here 
is the transient increase in frequency over 
the original steady state of the first om- 
matidium [black filled circles] which lags 
the transient depression in activity of the 
second ommatidium when its increment of 
illumination is abruptly turned off.) 


These assumptions were in their 
general form proposed almost a cen- 
tury ago to explain simultaneous and 
successive contrast in human vision 
(Mach, 1865), and in more quanti- 
tative form yielded interesting deriva- 
tions of such phenomena as “Mach 
patterns” (Ratliff & Hartline, 1959), 
illusions “seen” by Limulus, as well 
as perceived by man. An attempt 
has also been made to explain cer- 
tain streaming phenomena on a simi- 
lar basis (Taylor, 1958). In this 
paper, an explanation is proposed of 
figural aftereffects which also uses the 
above assumptions. Figural afteref- 
fects first were described by Gibson 
(1933) and further investigated by 
Kohler and Wallach (1944). The 
whole subject has been well reviewed 


by McEwen (1958). In this illusion, 
contours that are placed close in the 
visual field to the locus of a previously 
exposed contour tend to be phe- 
nomenally shifted away from this 
locus. The extent of this shift is 
largest not when the two contours— 
the one that was originally fixated 
(the inspection figure), and the one 
which subsequently was substituted 
(the test figure) are closest together; 
instead, the maximum effect of dis- 
placement is secured when the two 
contours occupy loci at a small inter- 
mediate distance from each other. 
(This is called the “distance para- 
dox.”) Not only does continuous 
fixation of the inspection contour give 
rise to a displacement of the second 
or test contour, but paling and re- 
ceding from the observer are also re- 
ported. How far such a phenomenal 
shift away from the observer is due 
to paling is difficult to establish. 

In order to explain such apparent 
changes in visual distance and relate 
them to the above assumptions con- 
cerning contrast, we must make one 
further hypothesis. We shall assume 
that distances between points are meas- 
ured in the nervous system by taking 
some kind of index of excitability of 
the points lying between them. Lower- 
ing the excitability of the intervening 
points will lead to an overestimation 
of the distance, and increasing the 
excitability of such points will lead 
to an underestimate. This concept 
may be clarified by use of an every- 
day analogy. It is convenient when 
traveling, say, in an airplane, to esti- 
mate distance by the time that it 
takes one to travel between two points. 
However, local conditions such as 
headwinds or tailwinds (to correspond 
to states of lessened and heightened 
excitability) occurring over portions 
of our journey, may lead to overall 
overestimates or underestimates of the 
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total distance. As the precise mecha- 
nism by which the sum of excitabilities 
of the points between two loci is 
measured is immaterial to our predic- 
tion of figural aftereffects, we need not 
embark on a discussion of such pos- 
sible mechanisms here, but postpone 
it to a latter section. 

Taking then our further assumption 
and combining it with those regarding 
neurophysiological contrast, if we 
place two points in a region close to a 
previously excited locus, some inter- 
esting consequences emerge. If the 
two points are placed so that the whole 
of the space between them is an area 
previously inhibited by the formerly 
excited locus, then they will seem 
closer together. The previously in- 
hibited area will now be temporarily 
more excitable and so distances meas- 
ured across it will be underestimated. 
On the other hand, if the two points 
are placed one each side of the pre- 
viously excited locus, close to its edge, 
then they will seem to be farther apart. 
The previously excited area will now 
be temporarily less excitable and so 
distances measured across it will be 
overestimated. How far each side 
of the previously excited locus we 
must go before underestimation due 
to enhanced excitability. counteracts 
the overestimation caused by lessened 
activity, is difficult to predict, because 
of the statistical distribution of the 
initially excited locus due to small eye 
movements during fixation (Ratliff & 
Riggs, 1950). It can be seen by ref- 
erence to Figure 2 how the hypothesis 
predicts further properties of figural 
aftereffects, such as the distance para- 
dox. Here the dotted line X repre- 
sents the deviation from the normal 
excitability which would be expected 
from the assumptions about neural 
lateral inhibition and its rebound. De- 
pression of excitability occurs maxi- 
mally at the locus which had been ex- 
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Fic. 2. Alterations in excitability of re- 
ceptive layer due to the previous presence 
of inspection point at I. (Y indicates 
normal level of excitability; X the devia- 
tion from it. Alterations in apparent dis- 
tance are computed by the area under the 
curve X lying between two test points, 
such as T., T. Ts, and T.) 


cited by the point of a contour of the 
inspection figure. 

If contours are placed at Ti, Ta, and 
Ta, since excitability in the regions be- 
tween them is increased, apparent dis- 
tance between them is decreased. 

Further, the apparent shift of Tı 
and Tə toward each other will be 
larger than the shift of Ta toward Ts, 
as the total amount of increase of 
excitability is greater between Tı and 
Ta than between Tə and Ts. Hence, 
the largest shift does not occur 
nearest to the inspection (I) contour, 
but some distance away from it. 
In this way, the theory predicts 
the distance paradox first reported by 
Köhler and Wallach (1944). If two 
contours are placed each at Ta and Ta, 
one each side of the previous inspec- 
tion (I) contour, they will appear 
further apart since excitability in the 
region between them is decreased. The 
total absolute displacement of these 
contours will be larger than if they 
had been placed closer to the inspec- 
tion (I) contour, thus yielding a dis- 
tance paradox for this case also. How- 
ever, it should be clear that the point 
at which displacement is maximal dif- 
fers on this theory for the case where 
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the test (T) contours are both placed 
on one side of the previous inspection 
(1) contour and for the case where 
the test (T) contours are placed one 
each side of the previous inspection 
(1) contour. There is a further dif- 
ference to be expected in these two 
cases. The total absolute displace- 
ment measured as an increment over 
real distance for the case where the 
test contours are placed one each 
side of the inspection contour is greater 
than if they were placed closer to 
the locus of the previous inspection 
figure. However, if they were placed 
closer, their proportional displacement, 
relative to their real distance from 
each other, would be larger. This is 
not true of the two test contours, both 
placed on one side of the inspections 
contour. Here the proportionate shift 
(proportion by which space appears 
to shrink) will increase with distance 
from the inspection contour up to a 
certain point. Furthermore, both the 
absolute and percentage amount of 
shift ought to decline as we move one 
of the points of a pair on each side 
of the inspection figure further away 
from the other point. In fact, given 
an approximately equal influence of 
the increase in excitability and its de- 
crease, we might expect any shift 
due to figural aftereffects to disap- 
pear completely as we moved both 
inspection points away from the locus 
of the test figure. On the other hand, 
as long as one of the points of a pair 
both on the same side of an inspection 
figure is fixed in an area of heightened 
excitability, the absolute, though not 
percentage, apparent decrement in dis- 
tance from its partner should not de- 
crease. (However, both the percent- 
age and absolute decrement in dis- 
tance may increase in this case if the 
increase in distance is taking in terri- 
tory with a heightened excitability.) 
As Fox’s (1951) and Heinemann 


and Marill’s work (1954) suggests, 
figural aftereffects can occur over 
large distances. On the present 
theory, we should expect to find a 
diminution of distance between two 
test points both on the same side of 
a test contour as long as one of them 
was some way within the territory 
previously covered by lateral inhibi- 
tion from the test contour, but not if 
it was outside it. Some measurements 
of this type already have been under- 
taken (e.g, Bevan, 1951; Craik & 
Zangwill, 1939; Fry & Bartley, 
1935). None has directly compared 
the distance over which figural after- 
effects can occur with the extent of 
lateral inhibition or contrast within 
the same experiment. However, the 
distance over which thresholds are af- 
fected by the presence of previously 
present contours was found by Bevan 
(1951) to be 6 degrees and this is 
probably not the maximum figure. 
Therefore, we could on the present 
theory expect detectable figural after- 
effects over considerable angular dis- 
tances. 

Hebb (1949) has pointed out that 
two test lines parallel to each other 
and at right angles to the inspection 
contour are seen closer to each other, 
whereas on the Köhler-Wallach (1944) 
theory, they should appear further 
apart. 

On Osgood and Heyer’s (1951) 
theory (discussed in greater detail 
elsewhere, Deutsch, 1956) the distance 
between them should suffer no dis- 
tortion. The present theory would 
indeed predict that the two lines should 
be seen closer together, because they 
lie in a zone of enhanced excitability. 

Köhler (1951) states that figural — 
aftereffects seem to grow during the 
first few moments of the test period. 
That this should be the case is evi- 
dent from the plot of the onset of 
increase of activity in a Limulus re- 
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ceptor unit after an adjacent unit has 
ceased to be stimulated. There is a 
lag of over .25 second before increase 
of activity reaches a maximum. 
Krauskopf (1954), who attempted to 
investigate the statement of Köhler’s 
quoted above, and with negative re- 
sults, unfortunately left a blank of at 
least 6 second between the removal 
of inspection figure and the presenta- 
tion of the test figure. Regarding 
time taken to produce aftereffects 
after the fixation of an inspection fig- 
ure and the time course of their de- 
cay, work such as that of Hammer 
(1949) has been performed but un- 
fortunately no neurophysiological data 
is available which would permit direct 
comparison. A comparison between 
such times in figural aftereffects and 
simultaneous and successive contrast 
would permit us to test the relation 
expected on the theory. 

It has been reported by Köhler and 
Wallach (1944) that outline figures 
have their maxima of effectiveness in 
displacing test contours at a greater 
distance than solid figures. Here it 
might be interesting to recall the situ- 
ation occurring in Limulus and its 
theoretical explanation by Hartlone, 
Ratliff, and Miller (1961). Aggre- 
gates of excited cells exercise an in- 
hibitory action on each other. A 
single ommatidium produces a higher 
frequency of discharge than when it is 
illuminated together with its neigh- 
bors. Similarly, the inhibitory effect 
of a whole group of excited receptors 
alters as its size is increased. The 
difference that would exist between 
line and solid figures is therefore not 
easy to predict without more precise 
information about contrast in the hu- 
man visual system. However, here 
again, there should be a congruence 
between behavioral data on contrast 
and figural aftereffects. 

However, it may here be objected 


cated. For 
(1961) reports that in the cat one 
half of the investigated cells fire when 
not stimulated and then exercise a 
laterally inhibitory action on a large 
surrounding field. The other half are 
more similar to what we would ex- 
pect from Limulus. These fire when 
they are illuminated and then inhibit 
a large surrounding field. Therefore, 
black lines could give rise to the type 
of lateral inhibition postulated here 
to account for figural aftereffects. 
Alternatively, the human system may 
be responding in terms of contours, 
rather than lines, as suggested by 
Köhler and Wallach (1944). 

It might further be objected that 
the test contours would introduce 
their own patterns of inhibition and 
excitation into the situation as it is 
illustrated in Figure 2, which might 
produce all kinds of unexpected effects 
through processes of summation. This 
is true, but these new contours would 
be adding the same patterns of inhi- 
bition and excitation to the pre-exist- 
ing pattern or baseline when there 
has been no previous prolonged in- 
spection of other contours. Such pat- 
terns are presumably discounted, or 
any distortions of distance there are, 
are the distances we normally see. In 
the typical figural aftereffect experi- 
ment we compare a “normal” dis- 
tance which has not been distorted by 
the introduction of previously 
contours, but which still may be dis- 
torted in some way by the points we 
use to demarcate it (e. g., Oppel’s 
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illusion—Teuber, 1960), with a dis- 
tance which has been distorted by our 
previous experimental manipulations. 
In other terms, the processes set up 
during the test are the same for the 
experimental and control stimuli and 
what the nervous system measures is 
the difference in the baselines for 
measuring distance on which both the 
control and the experimental stimuli 
ride. 

However, it is worth noting that 
the theory would predict not only 
some illusions due to the presence of 
contrasting points during steady ex- 
posures, such as the filled-unfulfilled 
space illusion (Oppel, 1855, quoted 
by Teuber, 1960), but phenomena of 
increases in distance with progressively 
longer times of exposure. Ratliff 
(1961) and Hartline, Ratliff, and 
Miller (1961) report that the lateral 
inhibitory action of a stimulus lags its 
excitatory effect. Such a relation may 
be clearly seen in Figure 1 in com- 
paring the consequence of stimulating 
one receptor. Its inhibitory action 
on another receptor (the first down- 
ward peak) is delayed in time in 
relation to the excitatory peak. We 
would expect, therefore, that when two 
points are flashed on, their distance 
should be underestimated and rapidly 
grow as the decrease in excitability 
in the space between them develops, 
due to an increasing lateral inhibition. 
This increase in apparent distance 
should then decrease with continued 
exposure. It is possible that some 
phenomena of apparent movement, 
such as the gamma phenomenon and 
underestimates of distance, when 
rapid rather than slower movement is 
induced, are due to this effect (Koffka, 
1935; Teuber, 1960). 

Further supporting evidence relat- 
ing figural aftereffects to changes in 
neural excitability is contained in re- 
ductions of the threshold of pitch 


discrimination at a particular ff 
quency after prolonged auditory stim 
lation at that frequency (Deut 
1951). Decreases in spontaneous 
tivity in the auditory system follow- 
ing prolonged auditory stimulation have 
been found by Star? and Hilali and 
Whitfield (1953). If, as it has been 
suggested, discriminative acuity is re- 
lated to the noise level in a system, 
then a decrease in threshold following 
prolonged stimulation can be at- 
tributed to a decreased noise level 
due to the preceding prolonged stimu- 
lation. 

Earlier, we postponed a discussion 
of how increase or decrease in ex- 
citability could be translated by neural 
mechanism into distortions of distance 
between two points. Although the 
previously stated theory does not de- 
pend on our ability to state what such 
a mechanism is, the plausibility of the 
theory is increased if some suggestion 
can be made about the working of 
the postulated neural mechanism, so 
that what has been postulated can at 
least be seen to be possible and in 
principle relatively simple in its mode 
of operation. Two possible models 
will be suggested, but it should be 
stressed that the correctness of the 
hypothesis about figural aftereffects 
is independent of the speculations that 
follow. 

In the first model, let us suppose 
that the distance between two points 
on a projection surface is estimated 
by the time it takes a wave to travel 
from one point to the other. Let us 
further suppose that the more excitable 
a region over which the wave travels, 
the more quickly it will travel over 
that region. This could occur, for 
instance, as a result of decreased 
synaptic delay in a more excitable re- 
gion. Similarly, such a wave may be 
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slowed down by a decrease in ex- 
citability in a given region. On such 
a system, if the region between two 
points were to become more excitable, 
then the time taken for the wave 
to travel between these points would 
diminish and so the apparent distance 
between the points would also di- 
minish; the contrary would happen if 
the region became less excitable. 

A second, entirely different system 
could give rise to the postulated re- 
lation between excitability and dis- 
tance. We may suppose that two 
points are projected on a succession 
of superimposed sheets or planes. We 
may also assume that each plane 
produces an output when two points 
below a certain distance apart from 
each other are projected on it, thus 
acting as a summation device or a 
coincidence gate. Each successive 
plane is more sensitive to points fur- 
ther apart than the last so that points 
a great distance apart from each other 
will produce an output only in the 
last few planes. In this way, the 
distance apart of any two points will 
be estimated by which planes are pro- 
ducing an output. (For further de- 
tails, see Deutsch, 1962.) 

We can assume that if a portion 
of these planes is rendered more ex- 
citable, the plane will produce an out- 
put for two points lying in this por- 
tion, which would normally be only 
just too far apart to produce an out- 
put on this plane. As an output from 
this plane usually indicates the pres- 
ence of two points which are closer, 
an underestimate of distance will be 
produced as a result of increased ex- 
citability between two points on such 
a plane, A similar argument can be 
used to show how an overestimate of 
distance would result from a decrease 
of excitability between two points. 
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Coxciustoxs 


The present suggestion attempts to 
reduce figural aftereffects to the same 
underlying physiological cause as the 
phenomena of simultancous and suc- 
cessive contrast by making an assump- 
tion about how distances are com- 
puted by the nervous system. Since 
lateral inhibition occurs at all levels 
in the nervous system, the hypothesis 
does not postulate an exclusively cen- 
tral or peripheral origin of figural 
aftereffects. Before the hypothesis 
can be accepted, further work is needed 
examining the relation of contrast phe- 
nomena in their temporal and spatial 
aspects to figural aftereffects on the 
behavioral level; and on the more 
physiological level, work on mam- 
malian nervous systems to investigate 
changes in activity at various dis- 
tances from an excited locus after pro- 
longed stimulation, when such stimu- 
lation has ceased. 
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A general review of new methods, data, and recent interpretations of 


memory is made. 
1000 Ss reported here 


Especial emphasis is given to the 
and 1000 Ss examined earlier. 


observation of 
It is found that 


motor and verbal methods of research have abruptly become more 


similar and have parted with 
Rather than concentrating on 


the Ebbinghaus tradition in many ways 
between-subject phenomena, between- 


materials phenomena, and between-trials phenomena, the present ap- 
proach related the events within-trials and also relates the events 
between 2 adjacent trials. Memory is described for 3 basic events: 
(a) response repetition, (b) response alternative, and (c) informa- 


tive feedback. Changes in memory 


changes in their interdependencies and upon 


these events are measured 
their dependence on the 


for 


prior events of Training. Many laws of retention are best specified 


by using indexes of consistency based on variance 


By 1958 much research in verbal 
learning gave the appearance of being 
fixed upon short-term memory and/or 
ancient dependent and independent 
variables. This was to change rap- 
idly with Peterson and Peterson 
(1959), Underwood and Postman 
(1960), Sperling (1960), and Post- 
man (1961, 1962) who have found 
limitations in old procedures, proposed 
new variables, introduced major new 
techniques, and altered our thinking 
on portions of the forgetting curve for 
verbal materials. An especially im- 
portant procedural innovation, intro- 
duced by Peterson and Peterson 
(1959), was the use of a single train- 
ing trial. Underwood (1957), in a 
brilliant analysis of forgetting, asked 
why the classical curve in verbal learn- 
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and correlation. 


ing fell so far and so fast, He found, 
after making allowances for proaction, 
that verbal materials were very much 
less subject to forgetting than past 
data going back to Ebbinghaus (1885) 
had indicated. 

Bilodeau and Bilodeau (1961) re- 
viewed the contributions of the years 
1945-1959 to the area of forgetting 
of motor skills and showed once again 
that most studies of such forgetting 
failed to find it. They were unable 
to find any full-scale retention curve— 
a failure that, of course, made it im- 
possible to summarize the variables 
determining motor skills forgetting 
and their roles in producing more or 
less of it. By 1962 there were only 
portions of curves in verbal learning 
and still no curve to speak of in motor 
learning. The primary object of this 
review is to examine the writers’ full- 
scale, long-term retention curves 
to discuss the critical variables con- 
trolling forgetting and failure of for- 
getting of skills. 
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CRITICAL EVENTS OF MEMORY 


Fic. 1. Identification of the two major 
events of a trial of Training and the three 
major events of a trial of Recall. 


A Remedy 


In a monograph on motor and verbal 
memory, Bilodeau, Sulzer, and Levy 
(1962) have asked what primary 
events might be subject to forgetting. 
Their analysis centered on the analy- 
sis of components within a single trial 
of training and a single trial of mem- 
ory and was recommended for both 
verbal and motor applications. As a 
matter of fact, the verbal/motor 
dichotomy was not needed except to 
specify training materials. The criti- 
cal events of Training and Recall are 
shown in Figure 1 and are defined as 
follows. The Ri is the first and only 
response of Training, the Ra is the 
alternative response to Ri, and the KR 
is the informative feedback pertaining 
to Ri or the feedback pertaining to Ra. 
In Recall, Ra, Ra, and KR, are the 
subject’s attempts at recollecting the 
earlier events. The Ra represents the 
subject’s attempt to repeat Ri; Re is 
not an attempt to improve upon Ri. 
Components KR and KR,, such as 
“You were low by 54 units,” “You 
told me I was high,” are more verbal 
than motor and emphasize the early 
involvement of verbal matters even 
at the roots of motor responding. 

The response terms Ri, Rs, and Ra 
may be either motor or verbal depend- 
ing upon the experiment the experi- 


menter chooses to do. The Ry 
KR are the two observable events 
Training. The Re, KR, and R* a 
the recallable events of the very n 
trial. An important point to bear i 
mind is that R, R., and KR can 
made variable, continuously scal 
quantities as in a lever positioning 
task. In the remainder of this paper 
the events are treated as continuously 
variable even though they are not al- 
ways; i.e., when the subject repeats a 
training word or when the experi- 
menter says “correct.” The object of 
using scaled variables was to relate 
the individual events of the recall trial 
to those of the training trial and to 
each other more precisely. The litera- 
ture contains little about these relation- 
ships and, indeed, even discerning the 
individual events within trials is 
blurred. Certainly, when we do sepa- 
rate the within-trial events, it is clear 
that more than further investigations 
of phenomena between subjects, be- 
tween materials, and between trials, we 
require a series of studies relating the 
events within trials and the events 
between adjacent trials. It was 
thought that the task of relating could 
best be begun by examining con- 
sistencies and changing consistencies 
between the primary events. For ex- 
ample, it was soon found that with 
increasing retention intervals, forget- 
ting the events in a lever-positioning 
task showed in increasing variances, 
decreasing relationships (r?) between 
the events of Recall and Training, and 
increasing relationships (r?) between 
the events of Recall. 

The analysis of component events 
suggested by Figure 1 led to a great 
number of questions which could not ; 
be answered without getting into many 
particulars. To take just one example: 
will the subject's long-term recall of 
Ra depend more upon (a) the recalled 
KR, or (b) the administered KR? The 
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memory monograph contains the com- 
plete formulation and considerable 
amounts of data as well. The present 
study deals mainly with the effects 
of the retention interval, an interval 
ranging up to 6 weeks where no special 
interpolated activity is required of the 
subject and where normal daily activi- 
ties obtain. 

One of the effects of the previous 
study was to show the usefulness of 
variance and correlation in measuring 
retention. This is not at all surprising 
for it is well established that the vari- 
able error contains information which 
a measure of central tendency does 
not. And these measures are especially 
important when the amplitude of re- 
sponse represents the initial observa- 
tion. In the past, most analyses of 
memory could be reduced to the form: 
there is no difference between the 
mean frequency of items, mean time 
on target, etc., of Training and Re- 
call. Yet means can remain constant 
while major changes occur in the 
scores from one distribution to an- 
other. Individuals will begin to stray 
farther from the mean and to exchange 
position within the distribution.“ 
These changes in the data are often 
indications that retention is changing. 
Actually, there are seven ways to de- 
seribe potential changes in memory. 


Measures of Memory 


Changes between two arrays of 
scores (Training versus Recall) can 
be measured by M, o°, and r. Either 
one, two, or three of these statistics 
can change from Training to Recall 
in a total of seven combinations. Text- 
books in the psychology of learning 
ordinarily reduce these seven to one 
and perhaps in the long run to the 


3 Lewis (1959) has made serious use of r 
in studies of retroaction of the OL-IL-RL 
design. 


most trivial of the seven, ic, the 
average amount of material recalled is 
less than the average previously 
learned. Modern theories of learning 
are silent on the question of the en- 
vironments that will produce one or 
another of the seven dependent possi- 
bilities. The writers do not know 
either. 

Figure 2 has been drawn to encour- 
age further questions about variance 
and correlational phenomena (and any 
other statistic of change) in memory 
research. In Part A, Ry represents 
the distribution of the first response 
of a large number of subjects. The 
distribution Ra represents the subjects’ 
attempts to reproduce their previous 
Ri. The arrows show what might hap- 
pen to four hypothetical subjects. In 
short, they do so badly at recalling 
(Ra) that the correlation between the 
two arrays of scores (Ri- Ra) falls to 
zero and reflects what is happening 
within subjects from one trial to the 
next. Yet, by comparing the two 
distributions still further we see 
the means and variances hold constant. 
The mean and variance have been com- 
pletely insensitive to the very substan- 
tial amounts of forgetting taking place. 
Part B of Figure 2 represents the 
case of change of variance, no change 
in mean, and no change in correlation. 
In this case of forgetting, the subjects 
can be said to be getting wilder than 
ever, yet Z scores and means remain 
the same from Training to Recall. 

In the experiment reported below 
we look for evidence on the form of 
the retention curves for Ro, KR,, and 
Ra as measured by variance and cor- 
relation under conditions of simple 
training and recall environments. The 
aim is to consider memory as an as- 
pect of response consistency and to 
consider the merits of the hypothesis 
that forgetting is well expressed in 
terms of response inconsistency. 


Part A 


Re 
Ri 
Part B 


Fic, 2. In Part A, Ra represents a dis- 
tribution of responses where the subjects at- 
tempt to repeat their prior responses (R.). 
Here, forgetting has led to a correlation of 
zero, but mean and variance have held 
constant. In Part B, forgetting has led to 
an increase in variance, but mean and cor- 
relation have held constant. 


METHOD 


Subjects and apparatus. Over 1000 sub- 
jects were trained and 946 of these were 
tested for recall, Half were scheduled 
from college sources and half from Keesler 
AFB. However, since absences were 
greater in the colleges, the colleges ulti- 
mately yielded 380 subjects and Keesler 
produced the remainder.* The proportion 
of college subjects from group to group 
was fairly constant. Furthermore, the col- 
lege and airmen subjects have behaved 


4The permission of the Commander of 
Keesler Air Force Base and the assistance 
of Major Marvin Karp are gratefully ac- 
knowledged. 

Civilian subjects were obtained from 
Louisiana State University, Baton Rouge; 
Louisiana State University, New Orleans; 
and Tulane University, New Orleans. 
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alike during the present series of studies 
and mixing these subjects is now routine, 

The apparatus was Manual Lever D, a 
box housing a movable lever (Sulzer, 1963). 
The subject's task is to stand in front of 
the box, grasp the unseen lever, and move 
it from left to right on the command of 
“move.” Two targets, one corresponding 
to Ri and one to R', were employed. 
Target 100 was 20° beyond the response 
origin and 64° above the horizontal. Target 
300 was 60° beyond the origin. Responses 
and Targets may be converted into degrees 
of arc by dividing by 5.0. 

Procedure. During Pretraining, the sub- 
ject moved the lever (twice) until it 
stopped automatically at one, then the other, 
of the two targets. Thereafter, the experi- 
menter removed the automatic stops and 
the subject had to stop the lever without 
any mechanical assistance. During Train- 
ing, the subject was required to guess 
which of the two targets was “correct” 
and to indicate his selection by executing 
a move (Ri) blindly and without the aid 
of the stop. His KR was administered 
immediately after R. The procedures used 
were almost exactly those described for 
Group I-28 in the earlier study.? Recall 
followed the R. and KR events by 3 
minutes, 20 minutes, 2 days, or 6 weeks. 
These retention intervals will be used to 
designate the four groups. 

The groups were subdivided according to 
the amplitude of R and the KR adminis- 
tered as follows: 


1. 100 Shooters: subjects for whom R: < 
200. This defines their alternative target 
as 300 and their KR as Low or Correct. 


ë The experiments are highly related and 
were designed so that their data can be 
compared. This also means that the ele- 
ments of minor detail in common need not 
be rewritten here. The most important de- 
parture from the previous study was the omis- 
sion of the probes for Rs and Res (the sub- 
ject was not asked to repeat the Re and the 
R* he had produced a few seconds before). 
It had been shown that the reliability of the 
present task is very high, for the correla- 
tions between responses separated by a few 
seconds are over 90. There was no need 
to show this again and valuable time was 
saved by dropping the two probes. The 
effect upon the subject was probably to 
make recall a trifle easier in that Rs or Res 
constitutes an extra interpolated task. 
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(a) Low: ‘You chose the wrong target 
and did not move far enough by —— 
units.” 

(b) Correct: 
target.” 


2. 300 Shooters: subjects for whom R. . 
200. This defines their alternative target 
as 100 and their KR as High or Correct. 


(a) High: “You chose the wrong target 
and moved too far by —— units.” 

(b) Correct: “You chose the correct 
target.” 


“You chose the correct 


it might be helpful to summarize the 
task by following a hypothetical subject. 
For example, if R. equals 
subject is a 100 Shooter by Definition 1 
above and has overshot the target of his 
choice by 15 units. Further, for KR the 

i “You chose the 
wrong target and did not move far enough 
by —— units” (la above), and dismiss the 
subject. After 6 weeks, in response to 
the experimenter’s request to repeat R, 
the subject might move 130 units instead 
of 115; in response to the experimenter’s 
request for R', the subject might move 
310 units instead of 300; in response to the 
experimenter’s question about the previous 
KR, the subject might answer, “You said I 
did not move far enough by about 50 
units“ Now that the procedure has been 
made so specific it would be well to relate 
it to Figure 1. The Ris, Rss, and R*s are 
simple positioning responses ; KR and KR. 
are verbal events; a request for Ra (or 
R*) illustrates a selective learning com- 
ponent in that the subject must choose be- 
tween classes R. and R“. The foregoing 
analysis makes it impossible to describe the 
present task as motor, verbal, or verbal- 
motor. Instead, it is all of these. 

A unique use of KR prevailed, and the 
logic of it is described fully in the mono- 
graph. Nevertheless, to reduce a potential 
source of confusion, a brief overview of 
the KR procedure seems in order. The ex- 
perimenter’s choice of values for KR was 
arbitrary in order to make KR i 
of Ri. This makes it possible 
the events of Recall to two i 
events of Training. The subjects were 
prescheduled to be Correct or to be given 
KR of a i magnitude if Incorrect. 
The magnitude of R: served to determine 
if the subject were to be Low or High, 


to relate 
t 


pling rectilineariy from the werten J, S, 
I. . . 324, Thus, as both the mmber 
and the unit were arbitrary, it was expected 
that such KR. would harm the recall of 
R, and R* as contrasted with the customary 
expectation of improving responses throug 
feedback. 

The subject's activities during the reten- 
tion intervals must be made explicit, 
these activities may produce some of 
failures of memory to 
cautions were taken to prevent the 
from realizing that he was being 
that the experimenter would return 
his recollection of R. R', and KR. 
subjects of Groups 2 Day and 6 
were dismissed after Training and the 
perimenter returned to them in 
room at the time for Recall. 
however, so little time between T 
and Recall for Group 3 Minute 
of the subjects was sent back 
was also the practice for Group 
except for the college subjects, 
necessity had to be returned 
room lectures during their 
cannot be said that the four 
groups have had identical 
ing their intervals. It was and 
technically impossible to arrange 
neous retention environments for 
groups and we are not convinced 
was an important shortcoming. 
never be possible to have the 
6-week interval qualitatively 
tatively the same as for one of 3 
This is a type of limitation, 
serious one in view of the main objecti 
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ject broke the position and posture assumed 
for the training trial; even 

subject was dismissed from 
training environment (for 2 minutes 
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The distribution of the 946 


is shown as the lower entry in 
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TABLE 1 


NUMBER or SUBJECTS REVERSING TARGETS AND TOTAL SUBJECTS 
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| Group 
Ri KR — * 
| 3 Minute 20 Minute 2 Day 6 Week 
100 Shooters Low 3773 0/70 2/65 9/52 
Correct 1/59 1/50 0/49 5/44 
300 Shooters High 2/67 5/63 6/73 9/65 
Correct 2/53 1/58 2/58 8/47 


of Table 1. The upper entries show 
the numbers of subjects who selected 
the wrong target or who reversed the 
two target places during Recall. For 
example, after 6 weeks, 9 of 52 Low 
subjects responded with Re when asked 
for Ra and responded with Rə when 
asked for Ra. Reversal is a rather 
infrequent type of forgetting, but es- 
pecially so at 3 and 20 minutes. Even 
though the relative frequencies in 
Table 1 never exceed one in five, 
the reversing subjects have a powerful 
effect upon the statistics which follow. 
Reversing subjects fall outside the 
main distribution and contribute more 


6-Week Recall 
r=20 


50 


oo 150 200 


50 
MAGNITUDE OF FIRST RESPONSE 


i100 50 200 


Fic. 3. Scatter plots for 100 Shooters 
showing the deterioration of Rs with re- 
spect to Ri after 20 minutes and after 6 
weeks. 


than their share to M, o°, and r. Un 
fortunately, the worst hob is raised at 
a critical spot, among the Low of th 
3-Minute 100 Shooters, where an u 
ward fluctuation produced three r 
versing subjects, and in Group 2 
Minute 100 Shooters Low, where 
downward fluctuation produced no r 
versing subjects. As we shall see, 
the overall effect of these two fluctua 
tions was to produce inversions of 
rank order between Groups 3 Minute 
and 20 Minute and to give the appear- 
ance of better recall with the longer 
interval in the 100-Shooter analyses 
which follow. 

Figure 3 was prepared as a scatter 
plot to give the reader a better visuali- 
zation of the forgetting analyses to 
follow. The Rə is plotted against Ri 
for Group 20 Minute on the left side 
and Group 6 Week on the right side. 
Group 6 Week shows a greater vari- 
ance in Re and a lesser correlation for 
RI - Ra than Group 20 Minute. The 
inconsistency of recall in Group 6 
Week is further heightened by the fact 
that 9 of 11 subjects with R> 200 
executed Ry > Ra. 

The data of the earlier study made 
it clear that Target 300 presents an 
easier task than does Target 100. It 
is for this reason that the analyses 
below emphasize Target 100 over 300. 
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Effects of Forgetting upon the Means 


The means for the 100 and 300 
Shooters are given in Tables 2 and 
3, respectively. The 100 Shooters 
show the inversion between the first 
two groups mentioned before, and 
then the means of Re deteriorate pro- 
gressively (toward 300) with inter- 
vals of 20 minutes, 2 days, and 6 
weeks. The mean of 177 for Group 
6 Week is very much the same as 
obtained in the earlier study with a 
4-week interval, but the mean of 159 
for Group 3 Minute is much higher 
than expected. The Res at the bottom 
of Table 2 are for the Correct sub- 
jects and are more in line with ex- 
pectations. Furthermore, in all four 
cases the Correct subjects have out- 
performed the Low subjects ($ < 05, 
df = 1/454) in the attempt to repeat 
the original response. 

The R*s for the 100 Shooters show 
little forgetting taking place at Target 
300. The greatest evidence for for- 
getting is the 292 of Group 6 Week 
Low, which, though quite an accurate 
Ra, is a deterioration from a 330 level 


TABLE 2 
MEANS FOR THE 100-TARGET SHOOTERS 


292 


124 


TABLE 3 
Means FOR THE 300-Tarcer SHOOTERS 


— 


which would have resulted if the test 
had been scheduled earlier. There is 
a significant difference at the Os level 
between the Low and the Correct sub- 
jects at 6 weeks (t= 2.07, df = 94). 

Examination of the means of the 
300 Shooters in Table 3 will show no 
sign of forgetting in Rə and strong 
indications of forgetting in Re. The 
Correct subjects perform no worse at 
Re than the High, and the F between 
feedbacks is less than unity (df= 
1/476). At 6 weeks neither the High 
nor Correct performs well, for with 
scores of 181 and 196 they are ap- 
proaching a point midway between 
targets. These values do not differ 
from one another by a ¢ test. 

The Correct subjects do exception- 
ally well at recalling that they were 
told “correct.” Of 202 correct subjects 
moving toward the 100 Target, 2 did 
not recall their KR (both from Group 
6 Week). The Low subjects have 
more to forget, it being possible to 
forget that they were wrong, low and 
x units low. Of the 260 subjects in 


34 

S 5 100 SHOOTERS 

3 4 f 
w 

9 Low / 

4 3 

& / 

3 2 Correct 
8 

Si 


3min. 20 min. 


2 day 
RETENTION TIME IN LOG SPACING 


Fic. 4. An illustration of increasing vari- 
ance in a repeated response as a function 
of increasing value of retention time. (The 
state of KR serves as a parameter.) 


the Low category, 247 recalled some 
number and some sign. Eight of the 
13 subjects who did not were in the 
6-Week group and responded (a) 
with nothing, (b) “I was wrong,” 
or (c) “I was —— off.” The data 
of the 300 Shooters were very simi- 
lar. The category of subjects recall- 
ing some number and some sign is 
represented as KR, in Tables 2 and 3. 
All four groups show the tendency 
of recalling a figure smaller than the 
one administered. The mean for 
Group 6 Week is —31 and is smaller 
in absolute terms than the —183 ad- 
ministered. However, 15 of the 44 
subjects are mistaken about the sign 
of KR and 5 of these confuse Rẹ and 
Ra as well. Their influence on the 
means of Table 2 is great. 


Effects of Forgetting upon the 
Variances 


The variance of the recalled event 
serves as the dependent variable and 
the length of the retention interval 
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serves as the independent variable.* 
The four intervals were converted to 
minutes and then to logarithms. 
Otherwise, it would have been tech- 
nically impossible to express the dis- 
tances between the various rests (rang- 
ing from 3 to over 60,000 minutes) 
on graph paper. The log scale, more- 
over, does no violence to the shape 
of the curves to be reported.” The 
log scale has the further virtue of 
grouping the first two points and 
grouping the last two points. This 
makes it easier to determine whether 
the curves are negatively or positively 
accelerated inasmuch as two points of 
inquiry are placed at both ends of 
the scale. 

Figure 4 shows the variances for 
Re of the 100 Shooters. The curve 
with the lesser variances was pro- 
duced by the subjects told that RI 
was Correct. The curve for Correct 
is clearly positively accelerated while 
the acceleration for the Low is not so 
clear because of the (unreliable) loca- 
tion of the first point. The writers 
would have been pleased to have the 
three subjects who responded as if 
Re were Ra somewhere else, e.g., in a 
void pile. Making an allowance for 
the three subjects would produce a 
curve resembling that for Correct. A 
subsequent experiment confirmed the 
hypothesis that the shape for Low 
resembles the one for Correct. 

The Re variances for Target 300 
are not presented because recalling 


® Tables of variances and correlations for 
each of the groups have been deposited with 
the American Documentation Institute. 
Order Document No. 7620 from ADI Aux- 
iliary Publications Project, Photoduplication 
Service, Library of Congress, Washington 
25, D. C. Remit in advance $1.25 for micro- 
film of $1.25 for photocopies, and make 
checks payable to: Chief, Photoduplication 
Service, Library of Congress. 

No transformation of the abscissa can 
alter the fact that the first two or three 
ordinal values remain constant. 
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Fic. 5. Four illustrations of increasing variance in the recalled response as functions of 
the conditions of feedback and responses required. 


this response, as in earlier studies, is 
so easy. The variances for the first 
three groups were quite the same and 
the ratios of Rə/Rı were close to 
unity. At 6 weeks, however, forget- 
ting was substantial and the variance 
for Rə was nearly four times as large 
as for RI. Further, the Highs and the 
Corrects were very much the same 
at each interval. 

The variances of Ra in the top two 
graphs of Figure 5 are for 100 Shoot- 
ers and 300 Shooters. Note the strong 
tendency for positive acceleration in 
three of the four curves. Further, 
there is no substantial difference be- 
tween the Low and Correct and be- 


tween the High and Correct. How- 
ever, there is a strong tendency for 
the variances of Re to swell more in 
the 300 Shooters than in the 100 
Shooters because they have the harder 
task to remember. The reader should 
recall that Target 100 produces a 
slightly smaller Ri variance than does 
Target 300. 

The pair of figures at the bottom 
of Figure 5 compares the recall of 
Rə and Ra with target place held 
constant at Target 100. The one on 
the left is for KR Incorrect and the 
one on the right is for KR Correct. 
The results make good sense in that 
Re is more poorly recalled than Ra 
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Fic. 6. Illustrations of decreasing corre- 


lations with increasing value of retention 
time for the 100 Shooters. 


and the advantage is slightly to the 
subjects called Correct. The graph 
on the left confounds the two vari- 
ables of feedback and the response 
factor whereas there is no such con- 
founding in the graph on the right 
(all subjects were Correct). The two 
curves on the right, by their place, 
interaction, and acceleration, best rep- 
resent the predicted outcomes of the 
experiment. 

Figures for Target 300 are not 
presented. There was no difference 
in the recall of Rə and Ra for the 
first three retention intervals. At 6 
weeks, however, Ra deteriorated mark- 
edly for both the Low and the Correct 
comparisons. The deterioration was 
worse for Low than for Correct be- 
cause of the stimulus properties of 
KR. 

Lastly, it should be stressed that in 
the preceding analyses of means and 
variances, the data showing the greater 
forgetting have been made into 
graphs, whereas the data with lesser 
forgetting have been summarized in 
words. All the data are consistent 
with the statement that little or no 
forgetting takes place between 3 min- 


utes and 2 days of retention. The 
appears to be some forgetting befo 
3 minutes and a great deal more aft 
the 2-day interval. 


Effects of Forgetting upon the 
Correlations 


The tables of intercorrelations 
tween events have been deposited with 
ADI. The extra important correla 
tions are Ri- Ra and KR- KR, ane 
these have been plotted in Figures 6 
and 7 as a function of the retention 
interval. Low, Correct, and High sub 
groups are represented as parameters; 
Since all six curves tend to have 
negative slope it is safe to say 
the correlation falls away with increas- 
ing intertrial interval. Too, the great 
est drops in correlation occur between 
2 days and 6 weeks as did the great 
est changes in Ms and os. Inspec- 
tion of all six curves produces ¢ 
strong impression that the accelera 
tion is negative (often called positive 
in psychology), and by this assump 
tion, the deviations about the idealized 
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Fic. 7. Illustrations of decreasing corre- 
lations with increasing value of retention 
time for the 300 Shooters. 
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ously unreliable and so comparisons 
between pairs of curves are made at a 
considerable risk. We prefer not to 
compare them. Because of inversions 
and other considerations the writers 
find insufficient evidence to establish 
a difference between Correct and Low 
and between Correct and High. There 
is evidence in the 100 Shooters that 
the inconsistency within the distribu- 
tion of KR, is greater than that for 
Ra but this is not fundamental as it 
depends on the memorability of the 
particular KR numbers used. More- 
aver, the KR-KR, curve is already 
purged of certain subjects who fail to 
recall KR. When comparing the 100 


‘Shooters against the 300 Shooters the 


most noticeable effect is the greater 
fluctuation in the r’s of the 100 Shoot- 
ers. It is hard to decide whether this 
is a property of the target place per 
se or whether it is attributable to the 
fact that some subjects find it difficult 
to recall Rə at Target 100. The pre- 
ceding difficulties, however, do not 
bear on the conclusion that the three 
curves suggest a negative acceleration. 


Discussion 


A search of the literature on mem- 
ory for motor skills by Bilodeau and 
Bilodeau (1961) found no full-scale 
forgetting curve worthy of the name. 
They also concluded that interference 
has been difficult to produce when 
material is deliberately interpolated be- 
tween original learning and relearn- 


ing. A great deal of forgetting, how- 


ever, was shown in the data of the 
present series. Why forgetting was so 
strong here and so weak before is a 
question that brings up several syste- 
matic issues and important leads. 


1. The present studies use a finely 
grained measure of response—ampli- 
tude in degrees of arc. Oftentimes, 
grosser measures such as time-on- 


target and response frequency allow 
diverse ways of scoring and it cannot 
be said with assurance that the skill is 
retained when the score remains in- 
variant. It is important to note that 
the kind of motor skill that is so 
famously resistant to forgetting is not 
a rote-learned sequence of invariant 
responses. 

2. Variances and correlations were 
examined as well as the mean. For 
some applications, the statistics o* 
and r are more sensitive to incon- 
sistencies of responding. Lewis 
(1959) has used correlation in studies 
of the effect of interpolated learning 
upon retention, but few others have 
had adequate size of sample or the 
inclination. Bilodeau et al. (1962) 
have successfully employed and de- 
scribed what happens to the means as 
forgetting of the present task takes 
place, but they have also urged the 
wider use of indices of change within 
individuals. In motor research, error 
or magnitude of response is observed 
as a matter of course, and variance 
and correlation statistics naturally fol- 
low.“ 

3. The amount of training and ma- 
terial were both severely limited. Dur- 
ing Training Ri was executed only 
once and the experimenter did not 
ask for Ra. The KR administered 
was a single stimulus event to which 
the subject made no observable re- 
sponse during Training. Training 
was further minimized because the 
subject was not told that he was be- 
ing trained nor that he would later 
be asked for his recollection of any- 
thing. The pretraining and training 


s Criticism of our urging wider examina- 
tion of changes of the subject’s behavior be- 
tween Training and Recall has taken the 
form that many tasks are not amenable to 
variance or correlational analyses as sug- 
gested in this paper. The critics are correct; 
many tasks are not. 


38 Epwarp A. Brropeau anp C. Micnaet Levy 


given were adequate to produce high 
levels of intertrial consistency for pe- 
riods up to a day or so, but were in- 
sufficient to last 6 weeks. 


Reviewing the years of research on 
rote learning of verbal materials, Un- 
derwood (1957) stated that future ex- 
periments should stress the learning 
of a single association and the meas- 
urement of its decline in strength over 
time. By this he meant one training 
trial on one response item. The pres- 
ent study stresses single presentation 
too, but it uses three primary events. 
Even simpler experiments than the 
one reported here are possible. Future 
designs could eliminate R" and KR 
and employ only Ri, Re. On the 
other hand, the three events could be 
retained, and the source and level of 
recall controlled within the individual 
by reminding the subject of one or 
two events before asking for the recall 
of the other.? The retrieval of an event 
should depend on the state of recall of 
another (or the presence of another) 
and so the important question becomes 
the degree to which they are associ- 
ated either through pre-experimental 
means or through training. 

Peterson and Peterson (1959) have 
measured the short-term retention of 
individual items (Ri, Re) in a very 
original study. There remains, how- 
ever, the necessity of determining the 
retention of R* and KR in a more 
verbal context than levering. The 
analysis of KR, in particular, has been 
consistently overlooked in verbal learn- 
ing because KR is often confounded 
with the response term. In lists of 
paired associates, for example, æ — y is 
exposed after the stimulus term x, y 


® Work on long-term retention under con- 
ditions of artificially induced recall has been 
carried out by the writers (Bilodeau, Levy, 
& Sulzer, 1963). Stimulated recall bears 
on modified free recall, whereas the present 
paper deals with free recall. 


being ordinarily considered the 
sponse term. Indeed, though it has 
been noted before that y serves as a 
confirmation as well, its KR function 
has in fact been neglected. A simple 
analogy to the present experiment in 
a verbal context may make the point 
that verbal and motor retention can 
in fact be studied at the same level. 
For example, a verbal response might 
be the word selected from a number 
of alternatives, as in multiple choice. 
If the subject were shown Items X, 
Y, Z simultaneously, and chose Z, 
we could observe the forgetting of thi 
R's (X and Y) and the KR (correct 
or incorrect), and their interdependen- 
cies. For further analysis of verbal 
experiments in terms of Ri, Ra, KR 
see Bilodeau et al. (1962). 

4. Most previous motor skills stud- 
ies have used tracking tasks where 
both stimulus and response are con- 
tinuous and complex. Thus, we are 
in the position of obtaining more for- 
getting with a simple task and little 
training than with a complex task and 
considerable training. The complex 
task situations seem to have produced 
high intratask transfer. Item 5 below 
bears on this perplexity. 

5. A long-standing enigma is that 
verbal skills are rapidly forgotten 
while motor skills are exceedingly 
well retained. Much of this problem 
has now disappeared because (a) Un- 
derwood (1957; see also Keppel & 
Underwood, 1962) showed that for- 
getting of verbal material is not neces- 
sarily rapid and great, and (b) the 
present studies show that the forget- 
ting of a positioning response, such as 
Ri or Ra, can be great. Recent years 
have brought changes in our expecta- — 
tion of the shape of the retention curve 
for both motor and verbal material. 
The classical (Ebbinghaus) curve has 
lost its pre-eminent position. Other 
techniques are producing curves with 
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other shapes. Even the immediate 
memory span has been reinvestigated, 
and this has led to forgetting func- 
tions within the “span” (Sperling, 
1960). To claim a comprehensive 
theory of human retention would be 
premature. But it is not overly opti- 
mistic to say that recent treatments 
of both verbal and motor retention 
represent advances toward an account 
of human retention and its controlling 
variables. The cheerfulness of the 
present outlook depends on the obser- 
vation that both verbal and motor 
forgetting can, with proper control, 
deviate from their stereotypes. It has 
become possible to talk about both 
in the same article, reversing the trend 
of several decades. 

Reanalyzing classical pre-World 
War II data, Underwood reported that 
85% or more of the items are gen- 
erally retained over 24 hours, and 
says, “In short, we must start all over 
and derive a retention curve over time 
when the subjects have learned no 
previous materials in the laboratory. 
It is apparent that I expect the fall in 
this curve over time to be relatively 
small.” The shape of the curves for 
levering is well in line with Under- 
wood’s expectation for we did find 
little evidence of forgetting between 
3 minutes and 2 days of retention. 

6. The writers suspect the existence 
of a two-stage curve of forgetting for 
positioning tasks of the lever variety. 
The subjects of all four of our groups 
disengaged themselves from the lever 
and training environment between 
Training and Recall. That is, by re- 
moving themselves from the training 
situation, all subjects did such things 
as: released the grip of the lever, 
moved the resting arm away from the 
top of the leverbox, changed foot po- 
sition, moved out of the room, moved 
back, etc. All of these events oc- 
curred between Ri and Rə and the 


Me ow week 


RETENTION TIME 
Fic. 8. Hypothesized two-stage forgetting 
curve. 


highest Ri- Ra correlation found was 
in the vicinity of .70. However, in the 
study by Bilodeau et al. (1962), some 
subjects were left engaged between 
successive responses and only a few 
seconds intervened between R, and Ro. 
These Ri- Ra correlations were always 
close to .95. Therefore, it seems that 
the act of moving away from the lever 
after Ri drops the correlation from 
95 to 70 (and nearly doubles the 
variance) and suggests why retention 
was less than perfect at the 3-minute 
point. Further inquiry is certainly 
required, but we believe there might 
be two decay stages to a full-scale for- 
getting function: a brief and shallow, 
but very rapid, first stage followed 
by a relatively long and accelerating 
second stage as shown in Figure 8. 
The evidence for the hypothesized sec- 
ond stage (solid line) is to be found 
in the section on Results; the first 
stage (dashed portion at left) has yet 
to be studied in detail with the lever. 
A virtue of Figure 8 is that the Y 
intercept is fixed at 1.00. Here, re- 
call is perfect in the sense that the 
variance ratio of Re/R; is unity or 
the correlation for Ry-Re is unity. 
Disengaging the subject could pro- 
duce (a) responses which interfere 
with the recall of Ri or (b) changes 
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in postural and environmental cues 
used to reproduce R;. Changes in 
stimuli and responses associated with 
disengagement from the training en- 
vironment, we guess, take rather sud- 
den effect and then no more. Actually, 
the first stage is best discussed as a 
manifestation of the warm-up phe- 
nomenon or as a loss of set. Adams 
(1961) has an extensive review of the 
evidence that warm-up decrement is 
one of the two portions of retention 
loss. 

The second stage represents a 
grosser type of forgetting—the one 
studied in this series. We need to 
know more about the explanation of 
the second stage and more about the 
description of the first stage. We 
guess that the first stage is completed 
within minutes or less, that the near- 
plateau period lasts for a day or two, 
and that the greatest amount of for- 
getting occurs after a few days and 
before several weeks. Some past stud- 
ies of motor forgetting may have 
placed too many of their empirical 
points between the two stages of de- 
cay and thus found no forgetting. 

The acceleration of the second stage 
is novel, since the Ebbinghaus curve 
shows the opposite sign. Data sup- 
porting the Ebbinghaus function are 
ordinarily explained by referring to 
exponential-like decay processes—a 
popular explanation because so man 
kinds of data can be fitted by the 
exponential that it might almost seem 
the only natural process. Using the 
idea of reliability, however, provides 
an attractive way of rationalizing the 
unusual acceleration of the second 
stage of Figure 8. Given that the 
subject can repeat a response from 
one moment to the next, he should 
also be able to repeat nearly as well if 
the attempted repetition is delayed by 
an extra moment. Said another way, 
if the probability of repeating a re- 


sponse exactly over interval ¢ is very 
high, there is no reason to assume 
that something unusual will happen 
over t k (where ¢ and k are of the 
same order of magnitude). By 
analogy, assume that a car is started 
once a day and that there are no fail- 
ures at starting. What is the proba- 
bility of failure with the interpolation 
of an extra day? Somewhat greater 
than before to be sure, but the writers 
doubt that the greatest increment in 
probability occurs in the first (of 
many) extra day of layover. Only a 
trifle of updating of pro- and retro- 
action concepts is required to explain 
the delay of forgetting shown in the 
second stage of Figure 8. The up- 
dating is based on the general idea 
that the probability of the occurrence 
of interfering events should rise in 
keeping with the expectation of events 
randomly distributed in time and that 
the growth of their effect is ogival. 

7. Last but not least, it seems rea- 
sonable to suppose that using two 
targets was responsible for some of 
the forgetting shown here. It was the 
use of a second Pretraining target 
which established Ra, We have seen 
R. produced when the experimenter 
asked for Re, and we have seen the 
difference between the means for Ra 
and Re approach zero with increasing 
retention intervals. The two target 
habits appear to interfere with each 
other and to do so increasingly with 
time. In the end there could be com- 
plete forgetting of class membership, 
an issue analogous to list member- 
ship (first or second) as discussed in 
verbal learning (Barnes & Under- 
wood, 1959; Postman, 1961). 

Only two targets have been used 
with the lever, though more could 
have been used; since one of our 
concerns is to produce motor forget- 
ting new studies should include more 
Ras. The work of Reid, Lloyd, 
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Brackett, and Hawkins (1961) and of 
Murdock (1961), who varied the num- 
ber of verbal items constituting the 
stimulus, is relevant. In the levering 
series, there were two types of loads: 
(a) one, two, or three basic events 
(Ri, R., KR) and (b) number of 
items within events (held at unity). 
Variations of load will quickly move 
the inquiry to considerations of pro- 
action because learning and memory 
should succumb with increasing load. 
And memory should also tend to fail 
as a function of responding qua re- 
sponding (retrieving) during Recall. 


SUMMARY 


Motor and verbal methods of re- 
search, forgetting phenomena, and ex- 
planations have abruptly become more 
similar, reversing a long apartheid- 
like trend. It seems desirable to ref- 
erence motor and verbal research in 
the same article, perhaps because 
both approaches speak to human mem- 
ory. The closer ties have also meant 
substantial departures from the school 
of Ebbinghaus. 
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SOME METHODOLOGICAL CONSIDERATIONS IN 


MULTIPLE-CUE PROBABILITY STUDIES! 


CAROLYN J. HURSCH, KENNETH R. HAMMOND, AND 
JACK L. HURSCH? 


University of Colorado 


Our principal concern is to make plain that in multiple-cue probability 


studies carried out within the framework of multiple regression analysis, 
the statistical properties of the environment and the statistical proper- 
ties of the response process affect the results. Such statistical properties 


must be considered when planning multiple-cue probability studies or 


interpreting their results. Formuli are provided specifying the limita- 
tions placed on achievement (inferential accuracy) by statistical factors 


in certain environments which may be arranged by an E. Illustrations 
of the application of the analysis from the 2 most representative cases 
are provided: 1 for probability learning, and 1 for clinical inference. 


Within 20 years after Brunswik 
(1943), Lewin (1943), and Hull (1943) 
debated the appropriateness of prob- 
ability laws in psychology, the study 
of probabilistic behavior has become 
one of the most vigorous pursuits in 
psychology. Whatever the upshot of 
the specific conclusions reached by 
these three, Brunswik's general argu- 
ment that probability relationships 
are not only scientifically respectable, 
but a useful point of departure for 
valuable theoretical and empirical 
work has now been accepted, explicitly 
or otherwise—especially in the field 
of learning. 

The argument that the environment 


1 This article is based on research at the 
Behavior Research Laboratory supported by a 
grant from the National Institute of Mental 
Health (Project Code Number: M-4977, 
Clinical Inference and Cognitive Processes) 
and the council on Research and Creative 
Activity of the University of Colorado. It is 
a part of the general program of research on 
inferential processes now in progress at the 
Behavior Research Laboratory. Data were 
analyzed through the facilities of Western 
Data Processing Center, at University of 
California, Los Angeles. 

2 The authors are indebted to Frederick J. 
Todd, Cameron R. Peterson, Daniel E. 
Bailey, Leon Rappoport, and M. M. Siddiqui 
for their assistance, 
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is an uncertain, probabilistic one, an 
that inferences about the environment 
are made on the basis of probabilistic 
data is fundamental to Brunswik's 
(1952, 1956) probabilistic functional- 
ism (see also Postman & Tolman, 
1959). To use the size constancy 
paradigm, estimates of object size are 
based on cues of differing ecological 
validities, where ecological validity is 
the correlation between the cue (e. g., 
linear perspective, interposition, etc.) 
and the distal variable (size). Cues 
also differ in their utilization by the 
subject, where cue utilization is the 
correlation between cue and inference. 
Finally, achievement is indicated by 
the correlation between response and 
distal variable. 

Brunswik used the analogy of a 
convex lens to describe this situation 
(see Figure 1). Because both the 
ecological and the organismic system 
involve a criterion (distal variable 
and response, respectively), and oa 
cause several variables (cues) are 
correlated in various amounts with 
both distal variable and response, 
multiple correlation methods may be 
applied to the analysis of the inter- 
action of the two systems and in- 
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deed this was intended by Brunswik 
(1956, p. 28; Hammond, 1955; Todd, 
1954). 

This paper directs itself to the ques- 
tion of the extent to which the subjects’ 
achievement of a distal variable is de- 
termined by the statistical character- 
istics of the environmental and organ- 
ismic systems. Put otherwise, to what 
extent is inferential accuracy de- 
termined by the correlations between 
cue and distal variable, and between 
the cues themselves, and to what 
extent is inferential accuracy de- 
termined by the organism’s utilization 
of cues, i.e., the correlations between 
cue and response. The answers to 
these questions are here determined 
within the framework of multiple 
correlational analysis. 

These questions are fundamental 
to probabilistic theories of behavior 
which do not restrict themselves to 
single cue situations, for three reasons: 


Brunswik's lens model. 


without a knowledge of the limitations 
placed on achievement by the sta- 
tistical characteristics of a given 
ecological or response system, it is 
impossible (a) to evaluate a subject's 
achievement within that system, (b) 
to compare a subject's achievement 
across ecological situations which have 
different statistical characteristics, and 
(c) to understand why the subject's 
achievement was as high or low as 
it was. 

Suppose, for example, in a multiple- 
cue probability learning situation the 
asymptotic level of learning indicates 
a correlation of .5 between response 
and distal variable. Lacking informa- 
tion concerning the statistical restric- 
tions placed on the situation by the 
intercorrelations between the cues, 
and the correlations between the cues 
and the distal variable, it would be 
impossible to say whether the subjects 
had reached a statistical limit or an 
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organismic limit. Suppose further, 
in a multiple-cue probabilistic percep- 
tual experiment on size constancy, the 
correlation between response and dis- 
tal variable is higher in a four-cue 
than in a three-cue situation. To 
what extent was the greater achieve- 
ment permitted by the change in 
statistical characteristics due to the 
arrangement of the stimuli? Lacking 
specific knowledge of the limitations 
on achievement imposed by the corre- 
lations between the cues and between 
cues and distal variables, it would be 
impossible to say whether the higher 
achievement in the four-cue situation 
was due to the increased number of 
cues, or to a relaxation in statistical 
restrictions on achievement brought 
about by the change in relations 
among the stimulus variables. Sup- 
pose once more that one wants to 
understand why a clinical psychol- 
ogist’s inference from test data to 
patient behavior was not highly ac- 
curate. Without specific knowledge 
of the limitations placed on the clini- 
cian’s accuracy by his manner of 
utilizing the cues provided, it would 
be difficult to suggest ways in which 
the accuracy of his predictions might 
be improved. 

Knowledge of the above-mentioned 
statistical characteristics does not, of 
course, exhaust the prerequisites for 
understanding learning, perception, 
or clinical inference, but it is, we 
believe, one important methodological 
consideration heretofore given too 
little attention. The importance of 
such considerations may be further 
illustrated by considering the experi- 
menter’s choices in arranging stimulus 
displays, and in evaluating response 
systems. 


Choices in Stimulus Arrangements 


Whenever a psychologist wishes to 
arrangea laboratory experiment which 


includes more than one independent 
variable, and includes probability re- 
lations among these variables, he must 
make a choice with respect to at least 
the following conditions: (a) the num- 
ber of stimulus variables to be in- 
cluded ; (6) the degree of relation to be 
arranged between each of the stimulus 
variables; and (c) if a distal variable 
is included, the degree of relation be- 
tween the stimulus variables (cues) 
and the distal variable. (Brunswik, 
of course, argued that all such choices 
must be guided by a consideration of 
the situation to which the experi- 
menter wishes to generalize and sug- 
gested that the subject's natural 
habitat should determine such con- 
siderations [see also Simon, 1956 ].) 

In addition to theoretical and prac- 
tical considerations, however, the 
above choices must also involve the 
statistical aspects of the problem. 
Leaving aside the question of the 
number of stimulus variables, we shall 
explore the statistical problems which 
arise from Choices b and e above. 
This implies that multiple-cue prob- 
ability studies carried out in terms of 
multiple correlational analysis will re- 
quire the experimenter to decide on 
the size of the multiple & controlling 
the stimulus arrangement. In general 
terms, the environmental function-form 
should be considered. 


Evaluating Response Systems 


Assuming that an experimenter has 
decided upon a certain stimulus ar- 
rangement in a multiple-cue prob- 
ability study, how shall the response 
system be evaluated? Measurement 
of achievement can, of course, be made 
fairly easily by comparing correspond- 
ing values of the distal variable and 
responses made to it—a comparison 
which can usually be made with a 
product-moment (although one must 
be mindful of Cronbach’s, 1955, 
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analysis of interpersonal perception 
scores). A more difficult problem is: 
how did the subject's utilization of 
the stimulus information affect his 
achievement? Brunswik (1956) as- 
sumed that the subject’s utilization 
coefficients should match the ecologi- 
cal validities of the cues. That is, 
the correlation between each cue and 
response should equal the correlation 
between each cue and the distal 
variable, and he certainly assumed 
that matching would be related to 
high achievement, e. g., “Ideally, cues 
should be utilized in accordance with 
their validity [p. 141]. The theo- 
retical and empirical work by prob- 
ability learning theorists concerning 
the matching of response probabilities 
with stimulus probabilities (e.g., Estes, 
1959) leaves no doubt but that the 
question has been considered worthy 
of investigation. We shall explore the 
statistical aspects of matching be- 
havior within the multiple correlation 
framework. Within the idiom of 
probabilistic functionalism, we shall 
explore the statistical aspects of the 
match between environmental func- 
tion-form and the process form of the 
subject’s response system. 


Linear versus Nonlinear Relationships 


There is one further aspect of 
multiple-cue probability studies to be 
considered—the linearity-nonlinearity 
question, i.e., the component-pattern 
issue. Experimenters must decide to 
what extent the relations among the 
cues should be linear or patterned, 
and they must be prepared to examine 
the extent to which the subject's be- 
havior can be explained by a linear 
or nonlinear model (Estes, 1959; 
Hammond, 1955; Hoffman, 1960; 
Meehl, 1960; Sarbin, Taft, & Bailey, 
1960; and others). This matter will 
also be considered, and relevant data 


from clinical psychologists will be 
examined. 


Cases to be Considered 


We have taken a Brunswikian 
posture toward multiple-cue prob- 
ability studies which leads us to 
examine six cases which arise from 
(a) choices in stimulus arrangements, 
(b) evaluation of response systems, 
and (c) the matter of linearity and 
nonlinearity. The six cases are: 

Case 1: Cues have intercorrelations 
of zero, and the subject matches his 
cue utilization coefficients with the 
ecological validities of the cues. 

Case II: Cues are intercorrelated, 
and the subject matches utilization 
coefficients with ecological validities. 

Case III: Cues have intercorrela- 
tions of zero, and the subject's utiliza- 
tion coefficients do not match eco- 
logical validities. 

Case IV: Cues are intercorre- 
lated, and the subject's utilization 
coefficients do not match ecological 
validities. 

Case V : Cues have intercorrelations 
of zero, and the differences between 
the subject's utilization coefficients 
and ecological validities are evaluated. 

Case VI: Same as Case V except 
that cues are intercorrelated. 

Furthermore, in the development of 
the six cases described above, we shall 
consider two subcases: (a) one in 


TABLE 1 


NATURE OF PRESENTATION OF THE ANALYSIS 
or Six RELATED CASES 
pe 


Relations between cues 


Relation between x 
ecological validities Intercorrela- {ntercorrela- 
(re.i) and utilization tions = 0 tions #0 
coefficients (C. 4 (rxix; = 0) (rxix; #0) 
Matched Case I Case II 
Not matched Case III Case IV 
Degree of Case V Case VI 
matching 
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which the situation is described by 
linear relationships among all the 
variables, and (b) one in which non- 
linear relations must be investigated. 
The nature of the presentation is 
indicated in Table 1. 


Definitions 
Assume that 
Ya F., X. „ X. [1] 


are standardized random variables 
(i.e., their mean is zero and their 
variance is 1) where (see Figure 1): 


V. is the distal variable in the 
environment 

Y, is the subject's estimate of the 
distal variable 

Xi, X, are the cue variables 


This assumption does not decrease 
the applicability of the results since 
any random variable x may be 
“standardized” by replacing it with 
the random variable X = x — m/s 
where m stands for the mean of x and 
s? is the variance of x. 


Correlation (V., X.) = rei 
Correlation (F., X.) = r, 


[2] 


Let R. be the multiple correla- 

tion of Y, and (&, „ Xa) 

on the environment side, [3] 
Let R, be the multiple correla- 

tion of Y, and (Xi, „ Xa) 

on the subject’s side. 


Under these assumptions, Y, and 
Y, have the following representation : 


F. (Be IXI Pace +BenXn)+Ze [4] 
V. = (ba 1X IE.. B. X,) +HZ 


where the quantities in parentheses 
represent the regressions, and Z, and 
Z, are the residuals. Z, and Z, are 
uncorrelated with any of the regres- 
sion variables (Xi, «++, Xn). 


Then 

R. = Beater + . + Been 
* — 9. 7 . 

R? — Ba . 1 4 + B. r. [5] 
sa 2 8.6. 


Also, EZ. = 0, EZ. = 0 (where E 
indicates “expected value“) 


var Z. = 1 Ra, 
var Z, = 1 — R}? E6] 
fa = E(Y.Y,) = Cov (F. F.), [7] 


and is a correlation coefficient repre- 
sented by the lens model in Figure 1, 
as achievement.“ 

Let the correlation (Z., Z.) = C. 
The C is the partial correlation be- 
tween Y, and Y, when the effects of 
Xu +*+, X, are eliminated, and may 
be described as the correlation be- 
tween the variance unaccounted for 
by the multiple correlation in the 
ecology and the variance unaccounted 
for by the multiple correlation in the 
subject's response system. 

The assumption is often made that 
residual variance is random, and, 
therefore, that the correlation between 
two sets of residual variances must be 
zero. This assumption is not a neces- 
sary feature of regression analysis 
(Siddiqui, 1960). The residual vari- 
ances may be due to random error or 
to nonlinearity (pattern) in the sys- 
tem. Therefore, randomness is but 
one of the many possible reasons for 
the existence of residual variance; to 
assume that randomness is the only 
reason overlooks the patterning or 
nonlinearity that might account for 
residual variance. (This point is dis- 
cussed below in more detail.) 

The variable C is important because 
achievement may be a result of the 
subject’s perception of a nonlinear 
relation between cues and distal vari- 
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able. If such a nonlinear relation de- 
veloped between cues and response, 
it would not, of course, be reflected 
in the subject’s multiple correlation 
coefficient. Although the value of C 
would not define the subject's specific 
nonlinear method of combining data, 
it would provide the following in- 
formation: 


1. If C were high and positive, 
this would indicate that the subject 
is correctly utilizing a nonlinear rela- 
tionship between the cues and the 
distal variable. ‘‘Correctly” here 
means that the subject's dependency 
on the cues follows the same non- 
linear form that is present in the cue- 
to-distal-variable relationship. This 
would contribute to high achievement. 

2. If C were high and negative, 
this would indicate that the subject is 


using a negative function of the non- 


linear relationship between cues and 
distal object. This would contribute 
to low achievement. 

3. If C were zero, this would indi- 
cate that the subject’s residual vari- 
ance is derived from the use of a 
nonlinear function which is not the 
same as that in the environment, or 
that one or both of the residual 
variances are random. 

4. Naturalistic studies provide a 
special case, for cues may be present 
in the environment and the subject 
may be utilizing them although the 
experimenter may not be aware of 
their existence. Thus, in naturalistic 
experiments where the subject can 
perceive cues other than those de- 
noted by the experimenter, the sub- 
ject may be utilizing a cue which has 
ahigh linear correlation with the distal 
variable. A high value of C would 
therefore be obtained, and would be 
misinterpreted. This point has been 
discussed by Hoffman (1960). In a 


contrived stimulus arrangement, how- 
ever, the intrusion of valid cues un- 
known to the experimenter is not 
likely. 


In all cases, the usefulness and sig- 
nificance of the variable C will 
decrease as the value of the environ- 
mental R approaches 1.00. 

The limits of accuracy (ra) will now 
be derived for the six cases. 
case will be developed first under the 
condition that C = 0, and then under 
the condition that C # 0. 


Case I 
Conditions: 
(a) rx,x,= 0, i jand 
(b) Tei = ei, for all i. 


The reader may be reminded that 
the relation between correlation coeffi- 
cients and beta weights is as follows: 


fei — L412 
1- ne 


Bea = 


where r. is the correlation between 
the distal variable Y and 

the cue X. 
fe,» is the correlation between 
the distal variable Y and 

the cue X= 
ry. is the intercorrelation be- 
tween the cues X, and X+. 


This expression is, of course, more 
complicated when there are more than 
2X;. For n > 2, the reader should 
consult McNemar (1962). Also note 
that when the X; are independent, 
the intercorrelations are zero, and 
Bei = Teie 

When the X; are independent, and 
matching occurs, i.e., if the subject’s 
utilization coefficients are the same 
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as the ecological validities, then 


R. = N. = R, 
and 


and 


7, ., = Fi 


ra = E(Y.Y,) 
E(T. T.) = EL (Be X. + f. X. + Z.) G. Ki. . + fl. X. + .) 
ELCL 8. f. XX + E(Z.Z,)] 
ijl 


= E pep jE(XX) + EZZ.) 


1.1.6 


since E(X;X;) in this case = 0 for 1 j 


and 


then 


E(X:X,) = E(X#) = 1 


E(Y.Y,) = 2 Be Bi € E(Z.Z.) 


but since in this case 


8. = ri 8551 


we have 


7. rü E +7,24+ E(Z.2Z,) 


where 


[8] 


E(Z. Z.) = Coz. o, 


and 


oz = V1 — Vee 


eso, 
then 
7 r tre + +++ „= Re. [9] 


In this case, as in an orthogonal 
component, factorial design, the cues 
are uncorrelated. Also, the subject 
matches utilization coefficients with 
validity coefficients. His multiple 
correlation coefficient, therefore, is 
equal to that in the environment. 
The limits of achievement are de- 
termined by the size of the (equal) 
multiple correlation coefficients (Rè 
and R,?) and the correlation (C) be- 
tween the variance unaccounted for 
by Rè and the variance unaccounted 
for by N. 2. If the correlation between 
these variances is 0 (as in Formula 9) 
then achievement is completely de- 
termined by the size of the squared 


oz, = Vi — Rè? * 
ry 


multiple correlation coefficients. In 
short, should a subject adapt to a 
completely linear environment by 
matching his utilization coefficients 
with the ecological validities of the 
cues, his achievement will be pre- 
cisely the square of the multiple corre- 
lation of the environment. Although 
the result may be perfectly obvious 
algebraically, it has not been obvious 
in experimental settings. Variation in 
environmental R, therefore, should 
become an important experimental 
variable in multiple- cue probability 
learning and perceptual experiments. 
Without such variation, results and 
conclusions remain highly situation- 
bound. 


Zit Gs 
then from Formula 8 


m= R +C = R). [10] 
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If C is positive, then, achievement 
(ra) will be increased as a function of 
the value of C. That is, to the extent 
that a pattern, or nonlinear arrange- 
ment exists among the cues in the 
environment, and to the extent that 
S utilizes this pattern correctly, his 
achievement will increase. If C is 
negative, ra, of course, will be reduced 
and may even become negative. 
Formula 10 provides the mathemati- 
cal explication of our earlier remarks 
about the value of investigating C, 
the nonlinear aspect of the achieve- 
ment variable. It also illustrates the 
fact that it is possible to partition the 
component and pattern determinants 
of the subject’s response. Since 
—1< C<1, the limits of achieve- 
ment in this case are given by 


27⁰ 1 Selene 


Case IT 
Conditions: 


(a) xx, ~ 0; i ; 
(b) re ra for all i. 


When the X; are not independent, 
and the utilization coefficients match 
the ecological validity coefficients, 
then 


Tei = ren = Yi for all i 


and 
Bei = 8.5 for all i, 


because the 6; are completely de- 
termined by the 7; and the rx,x;- 
(See also Footnote 3.) 


3 The beta weight enters here for the first 
time as a quantity not equal to r in the com- 
putation of each R and therefore each fa- 
This is due to Condition (a) above, which 
indicates that the cues (the X;) are inter- 
correlated, not uncorrelated as they were 1n 
Case I. The beta weight for the ecological R 
combines an ecological validity coefficient 
(rei) with the intercorrelations of each cue 


Thus 
R? = R? R 
also, 
R? = Biri + +++ + Bare = Tg. 


and 


re = E Bri + C — R). [12] 
1. If C = 0, 
then 
ra = Z Br: = R. [13] 


This case removes the restriction 
that the cues must be uncorrelated 
with one another. Matching still 
prevails, however, which leaves us 
with the same result as in the first 
subcase of Case I. Leaving cue vari- 
ables tied“ in their natural ecological 
form, then, has no effect on the limits 
of achievement (providing matching 
occurs and C = 0). We now consider 
the subcase where C # 0. 

2. If CX0, then because —1<C<1, 
from Formula 12, the limits of r. are: 


Lpr- U =S. 
<Ebr:+(1-— R). [14] 


Since ¥ Bir; = R, the value of C 
(Le., the degree of correlation between 
the nonlinear aspect of the environ- 
ment and the nonlinear aspect of the 
subject’s response process) will add 
to or subtract from achievement due 
to linearity. More specifically, if 
there is a pattern in the environment 
which is correctly used by the subject, 
then achievement will be increased 
precisely by the quantity C(1 — Ne). 


with the i cue. The beta weight for the 
subject’s response process-form combines 
a utilization coefficient (.) and the inter- 
correlations of each cue with the it cue. 
When the intercorrelations are all zero, then 
the beta weight 6; equals the corresponding 
validity coefficient 7, and Bir; = (7). 
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Case III 
Conditions: 
(a) rxx;= 0, i j, and 
(b) r r. 
When the X; are independent, then 
the 8. and g. in Formula 5 reduce 


to fes and ., respectively, and 
Formula 5 becomes 


Rè = 2 78: R? = . 7; [Sa] 
then, 
Ta = E(F. F.) 


To 7.1.1 
$ Pe, , 2 +s: + . 
+CV- RAQRA [15] 


1. If C = 0, then from Schwarz's 
inequality* 
7. 1.1 + 8 + Penan l 


SW rs = RR, [16] 


This gives the following bounds on Tos 
Kå R. R. < Ta < + R. R. [17] 


We conclude then, that under 
these circumstances, achievement is 
bounded by the product of the two 
multiple correlations involved. Note 
that should either multiple R be 1.00, 
the bounds on achievement are plus 
and minus the value of the other R. 
This observation is important because 
investigators frequently present the 
subject with an ecology which is com- 
pletely determined, i.e., R, = 1.00. 
(An example will be presented under 
Case V.) Thus Brunswik’s emphasis 
on the importance of environmental 
stipulations is borne out. 

Also, note that if there is no possi- 
bility for a positive C in a given en- 


for all i. 


‘The application of Schwarz’s inequality 
to this problem is shown in the Appendix to 
this article. 


vironment, a subject may increase his 
achievement by maintaining a re- 
sponse system which is more linear 
than the environment. In other 
words, under the conditions of this 
Case, if R, is less than 1.00, then 
matching cue utilization coefficients 
with validity coefficients will give as 
an upper limit, ra = R, while a 
completely linear response system, 
(R? = 1.00) will give the subject an 
upper limit of ra = R, which, of 
course, is higher than R? when R2 is 
less than 1.00. However, see Cases V 
and VI for a more detailed treatment 
of this point. 

Formula 17 also illustrates the im- 
portance of the experimental varia- 
tion of the multiple correlation in the 
environment. Suppose that the linear 
multiple correlation model does in- 
deed account for all of the variation 
in a subject's inferential behavior; 
then the upper bound on his maximum 
achievement is reduced to the extent 
that the environmental R departs 
from unity. Should one experimenter 
arrange his stimulus display so that 
the environmental R = eand a 
second experimenter set his environ- 
mental R = .3, then the upper bound 
on the subjects’ potential achieve- 
ments would be different in the two 
experiments, even though the multiple 
regression equation provided an ade- 
quate account of all subjects’ inferen- 
tial behavior in both situations. In 
short, adequate environmental con- 
trol in multiple-cue probability studies 
requires the specification and manipu- 
lation of the environmental multiple 
R. The authors know of few such 
considerations in the literature. (see, 
however, Azuma & Cronbach, 1961; 
Lee & Tucker, 1962; Rappoport, 
1961; Smedslund, 1955; Summers, 
1962; Todd, 1954.) 

2. If C0, then since Sil SC ile 
the limits of r, (using Formula 15) 
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are: 
Erer - VU- RAA -RÄ 
< Ta < 2 Fe fai 
+ VU- RAO- D. [18] 


According to Formula 18, if the 
residual variances Z.Z, are correlated, 
the limits of achievement are de- 
termined by the values of the eco- 
logical validities (., , the utilization 
coefficients (, the multiple Rs (Re 
and R,), and C. Most important, the 
complexity of Formula 18 means 
that when C= 0, the limits placed 
on the subjects’ achievement by the 
investigator will not be apparent to 
him until he has made the calculations 
indicated. Therefore, they must be 
made in order to evaluate the subject's 
achievement. 


Case IV 
Conditions: 
(a) rx, ~9, 1 * ; 
(b) rei * ri for all i. 


When the X, are not independent, 
and the utilization coefficients do not 
match the ecological validity coeffi- 
cients, then 


Rè = Beiter k + Benton 
and 

R? = Barts + *** + 8.7 · 
Since, from Formula 4, 


V. — — Be X. + Zo 


Y. = >» Be X. ar Zs 


and Y, and Y, are normalized, there- 
fore ra = E(Y., Y.) 


r= O bebe E (X X HEZ Z.) 


[5] 


and 


78 — 60 Be, x 2. z. oz. Z 
7 


and since 
(Z G. K* 
7 


g. xi + 8. xixs t °°" 


+ Be xX. = Teas 
then, 


fa = Z 8. 4 


+cVG—RAG—RA [19] 
1. If C= 0, 
then 


ra = D Brites = LB [20] 


Achievement in this case is deter- 
mined by the ecological validity co- 
efficients and the intercorrelations 
between the cues which may be 
arranged by the experimenter, and by 
the correlations between cue and 
inference which are determined by 
the subject. Obviously then, the 
experimenter, by his choice of validity 
coefficients and intercorrelations, can 
greatly affect the subject’s achieve- 
ment. For achievement depends upon 
a combination of the environmental 
validity coefficients, and the inter- 
correlations between cues as well as 
the subject’s cue utilization coeffi- 
cients, all of which are contained 
in the expressions g. . and 
E Be, fai: 

2. If C #0, then as — 158 C S1, 
the limits of ra are: 


E batei N= RAU- RY) 
< To < — B., , 
+V@—RAG—R?). [21] 


Thus, if an experimenter happens to 
have arranged the ecology in such a 
way that achievement will be low if 
the subject considers only linear rela- 
tionships between cues and the distal 
variable, then high achievement will 
be due to a high correlation between 
the residual pattern variance in the 
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environment and the residual pattern 
variance in the subject's response. 

The first four cases present the logic 
for determining the limits of accuracy 
when a subject either matches or does 
not match utilization coefficients with 
validity coefficients, and the formuli 
and rationale necessary to the solu- 
tions of the more general situations to 
be considered in Cases V and VI. 
Since perfect matching would prob- 
ably occur very infrequently in any 
empirical situation, and nonmatching 
could occur to any extent, the more 
realistic approach of Cases V and VI 
involving the degree of matching is 
considered. 


Case V 
Conditions: 
(a) rr, = O: i j; 
6) E C. — Ta)? di. 


As in Formula Sa, when XX, = 0, 
then 
Rè = Te, 


R? = PB 6.02. 


If complete matching occurs, i. e., 
if 7. = r, for all i, then d = 0, as 
in Case I. 

However, if all of the utilization 
coefficients do not match the validity 
coefficients, then 


d? = — 6. . E Ta)? 
=E 6.91 2 C. . 
458 650 
= ia 52 Ds Teits i + R? 


or 
Rê 3 Re — d? 


È (reta) = 2 [22] 


From Case III, Formula 15 we know 
that 


Lr u. URA) 


and 
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thereſore 
R. + R? — d? 
2 
+CV- RAO R). [23] 
If d? =0 (as in Case I) then 
R. = R. = R | 


fa = 


and 
ta = R + C(1 — R). [24] 
1. (a) If C = 0, and d} = 0, then 
(as in Case I) ra = R?, 
(6) If C = 0, and d} = 0, then 
e +R? — d? 


7. 125 e 
2. (a) If C = 0, and d? = 0, then 
(as in Case I, Formula 11) 


the limits of r, are: 
2R —1<r <1, 


(b) If CA o, and d 0, thea 
the limits of r, PA 


R2+R2—d2 ee O 
ES mee ee OTe —R,) 


2 ae 
one tR d? 


+HVU=RAO-RA [26] 


In this case, the cues are uncorrelated 
and achievement is shown to be a 
function of the squared differences’ 
between the utilization coefficients 
and the validity coefficients, as well 
as R? and R}. As this difference 
approaches zero, achievement will 


* + R 


approach 7 *, and at the same 


corresponding ři under this condition, There- 
fore, given that mx; = 0, then B. = i, and 


cues (X = 0), and any $; is equal to the 
d = dds, | 


„ 


1 eee 
„ 
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time be dependent upon the degree of 
correlation of the two residuals. (Note 
again that if R, = 1.00, then ra is 
completely determined by 
R. +R, — E.) 
7 ‘ 

We turn now to an illustration of 
Case V in which the environmental 
R = 1.00, intercorrelations between 
cues are zero, and where the degree to 
which the subject's utilization coeffi- 
cients match the ecological validity 
coefficients of the cues is considered. 
Summers’ (1962) multiple probability 
learning experiment provides appro- 
priate data. 

In Summers’ experiment there were 
3 cues (color, angle, and area) whose 
correlations with the distal variable 
(ecological validities) were .743, .371, 
and .557. Six different conditions 
provided six combinations of these 
three values with the three cues; e. g., 
in Condition I, the correlations of cue 
with distal object were: color .743; 
angle .557; and area .371; the next 
condition interchanged the correla- 
tions for angle and area, and so forth. 
Different subjects were used for each 
condition. There were six blocks of 
trials in each condition, the last block 
consisting of an extinction series. 
Thus, Summers’ data contain the 
necessary conditions for a simple 
application of the formuli of Case V, 
where rx, = O, and E (ei — 7,1)? 
= d, and where d, #0. The eco- 
logical multiple correlation (K.) 
= 1.00. (The example following Case 
VI illustrates the situation where 
N. K 1.00.) 

There are two points to be ob- 
served from the application of the 
formuli of Case V to Summers’ data: 
(a) ra, the correlation between the 
subject’s estimate of the distal vari- 
able and the actual value of it, may 
be computed from Formula 25 rather 
than from the correlation coefficient 
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TABLE 2 


APPLICATION or Equation 25 to Sonner 
No. 6 i Summens’ EXPERIMENT 


RA = 1,00 
RA = .970 for Block 1d, = 3,002 
Ro = 203 for Block V df= .634 
R. = 212 for Block VI d= .616 
Block ra= Loo t 979 = 500? = —.516 
Block V ra= ore = 204 
Block VI ra= ee MS = 298 


between the judgment and distal 
variable from the raw data as Sum- 
mers did. If the logic of the deriva- 
tion of Formula 25 is correct, fa 
computed from Formula 25 should 
be exactly the same as ra computed 
by Summers. It is, for all of the 17 
subjects whose data we analyzed. (b) 
The application of Formula 25 should 
throw some light on the nature of a 
subject’s performance. We will now 
demonstrate that it does. Formula 
25 reads, it will be recalled, as follows: 
2+ Re — d? 


ae. 


The relevant data for Subject No. 6 
are presented in Table 2. 

Note that during the first block of 
trials, the manner in which Subject 
No. 6 combined the data from the 
stimulus array essentially could be 
represented by the multiple correla- 
tion procedure; the multiple correla- 
tion (R2) between the cues and his 
response being .970. Thus, Subject 
No. 6 functions in almost the identical 
manner in which the environment, 
about which he must learn, functions. 
But note also that although Subject 
No. 6’s response process-form closely 
matches the form in which the en- 
vironment functions, it is badly 
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TABLE 3 


APPLICATION or EQUATION 25 to Sunyecr 
No, 12 ix Summers’ EXPERIMENT 


RA = 1.00 
R. = .523 for Block 1 d, = 2,092 
Rô = .306 for Block V d) = 486 
R. = 310 for Block VI dA = 379 


1.00 + .523 — 2.092 P, 


Block 1 ra= 7 — .284 

Block v ry OEM 456 425 
A 310 — 379 

Block VI e 379 aeg 


applied—the difference between eco- 
logical validities and utilization coeffi- 
cients (d.) being very large (3.002). 
As a result, achievement in the first 
block of trials is poor; r, is a fairly 
large negative coefficient, — .516. 
However, by the time Block V is 
reached, Subject No. 6 has largely 
given up a linear, additive procedure 
(N. = .203 instead of -970); also d. 
has been markedly reduced (from 
3.002 to .634), and as a result achieve- 
ment has increased (r, now equals 
＋. 284 instead of —.516). No marked 
change took place in Block VI. In 
short, an increase in achievement 
came about as a result of decreasing 
the match between the subject’s 
response process-form (R.) and the 
environmental function-form (RZ), 
and increasing the match between 
utilization coefficients and ecological 
validities. 

Subject No. 12, (Table 3), behaves 
in much the same way as Subject No. 
6 behaved in Block I and Block V. 
That is, Subject No. 12 increases his 
achievement despite decreasing the 
match between the environmental 
function-form (Rè) and his response 
orocess-form (R.), for he more than 
nakes up for this by increasing the 


match between utilization coefficients 
and ecological validities (reducing d.). 
In Block VI, Subject No. 12 main- 
tains the former match (.310 being 
virtually identical with .306) but by 
further decreasing da, slightly in- 
creases ra from .425 to .466. 

Subject No. 7 provides an illustra- 
tion of a different kind of performance 
by (see Table 4) decreasing a small 
d (from .962 to .019), and increasing 
his multiple R? from essentially zero 
(.003) to a rather high R? (.783) which 
increased r, from near zero (.020) to 
+.882. Thus, Subject No. 7 in- 
creased his achievement by increasing 
the match between his response 
process-form and the environmental 
function-form, as well as increasing 
the match between ecological validi- 
ties and utilization coefficients, 

In short, our analysis may be used 
to throw light on a subject's perform- 
ance. In the above examples, we 
could see that subjects can increase 
their achievement by different meth- 
ods, either by increasing the match 
between environmental function-form 
and response process-form, and/or by 
increasing the match between utiliza- 
tion coefficients and ecological validi- 
ties. Differing possibilities such as 
these seem simple, and perhaps ob- 
vious. At the same time, they raise 
fundamental questions: Can individ- 
uals learn to match response process- 


TABLE 4 


APPLICATION OF EQUATION 25 TO SUBJECT 
No. 7 or Summers’ EXPERIMENT 


S6V6=6=_— ͤñ˙⏓ 


Rè = 1.00 
Rf = 003 for Block I d? = .962 
“= .783 for Block VI da = 019 


Block I fa = 1.00 4-005 = 962 = 020 


Block VI ra .882 


SSS ee 


1.00 + .783 — .019 
e 
2 
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forms with environmental function- 
forms? This seems to be analogous 
to the learning of principles. It is 
certainly what Brunswik would have 
called “distal focusing.” The above 
analysis shows that not only can 
function-forms be matched, but that 
they can be misapplied (Subject No. 
6, Block I). Can individuals learn 
to match ecological validities with 
utilization coefficients? This ques- 
tion seems to concern proximal rather 
than distal focusing, but its impor- 
tance remains. Whatever the case, 
when examining the performance of 
an individual in a multiple-cue prob- 
ability situation, it will be necessary 
to consider both the match between 
response process-form and environ- 
mental function-form and the match 


between ecological validities and cue 
utilization coefficients. Formula 25 
makes this possible. 
Case VI 
Conditions: 
(a) TX:X; #0, tx j 
(6) ds = VE Bes — Ba); 
(c) d, = VE Gana 


If complete matching of utilization 
coefficients with validity coefficients 
occurs, then 

d. = O, and da = 0. 
If utilization coefficients do not match 
validity coefficients, then 


7 + RE = (Bei — Bris Len — Taai) +CV- D -R5 [27] 


2 


or, as C runs from —1 to 1, 


RE R= Ee 8.70 Toss — Le =D = g. 


< è + Rè — (Bee — Bair Tei — Tai) + N RAQ- RA [27a] 


2 
where 


(Bei — 8.5 7.1 — fs,%) = pi (Be,i 5 8.50 (res = 7.50 


however, by Schwarz's inequality 


| 6.4 Sat, 70 S dad, 


and the bounds for ra are: 


Rot Roa dh cd RAG RA <r 


„„ 


If 


(6 Beni tai T 7 =0 then R. = R. = R 


and 


fa = Rè + C(1 — Rẹ). 


® Recent work by Newton, (1964) at the Behavior Research Laboratory suggests that 


subjects can. 
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1. (a) If C = 0, and d, = 0 


then 


7. = RÈ, 


(6) If C = 0, and d, = 0 
then 


RE + R? — (Bei — Basis Tei Fu 


Fa 


and the bounds for r, are: 


R? + R? — did. <p, < BÈ + R? + dod, 


2 
2. (a) If C =Æ 0, and d, = 0 


2 


<= [28] 


then the limits of r, are (as in Case I, Formula 11): 
2R — 1 <r, <1. 
(b) If C #0, and d. 0, then since —1 < C < 1 the bounds of r, are: 


R? + R? — dad, 
2 


2 2 


— NIN R?) <ra 
< RÈ +R? + dod, 


This case is the same as Case V except that the cues are correlated; therefore, 
the beta weight is introduced into the consideration of differences between the 


The difficulty of obtaining the variable C by empirical methods may be over- 
come by manipulation of the algebraic relationships already presented, as 


follows: 
Since (from Formula 27) 


then 


TERI esi = Baa) Peni = rasi 
— 2. E. . 0 


This formula may be used for com- 
putations. 

As an illustration of the utility of 
the analysis presented under Case VI, 
data from a study by Todd (1954; see 
also Hammond, 1955) have been 
analyzed. Ten clinicians made judg- 
ments of the IQ scores of 78 patients 
after an examination of the patients’ 


P 8.) (Tee — rai) 
2 


＋ c = R2) 


1 
- 30 


„ o 


Scores on 19 Rorschach factors. In 
order to measure achievement, Wechs- 
ler-Bellevue IQ scores were obtained 
for the 78 patients, Multiple regres- 
sion equations were obtained for the 
environment by relating the Wechsler- 
Bellevue scores (distal variable) to the 
various scores on the Rorschach 
factors (cues). Multiple regression 


i 
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TABLE 5 
Data FROM Topp (1954) Snowixo tHe Errect or ENVIRONMENTAL 


1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11s 


Note.—Computational example: 
Clinician No. 1 
R. = .6022 
R? = .8585 
Ta = 316 
2 (Bes — Bai) (tes — ri) = .7740 
from Formula 27a 


R. + R. — È (Bei — Bai) (tet — Toi) 
i 2 


e O RAURA SED =. ee 


J + CVU = RAC — Re). 


Since the limits of C are: —1 < C < 1, the limits of ra for Clinician No. 1 are: 
$02 + 858 — 74 — V(.398)(.142) < ra S ‘oan + $58 — 8 + ¥(.398)(.142) 


345 — .24 < ra S 345 + 24 
105 < ra < .585. 
lf C = 0, then 
Rè + Ru 2 (Bei — Bai) Uei — fai) > 
R= 2 a 
.602 + .858 — .774 
ee rie 
345 = ra. 
To find the value of C, use Formula 30: 
Rè + Rè z (Bei — Bai) (fei — 20) 1 
C = (ra — {= a Oe Ja ya — Ry = 


2 = Re) = R. 
= ( 316) = ee 2) __t__ 
; 2 2 7 ¥(.398)(.142) 
= — .12. 
a Hypothetical clinician who matched cue utilization coefficients with ecological validity 


cients, 


With df = 1, the obtained x? = 1.89, p < .20. 
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equations were obtained for each 
clinician by relating his estimations of 
IQ to the Rorschach factors. 

Table 5 presents Todd's data (for 
39 patients) reviewed in terms of Case 
VI. Case VI, it will be recalled, is 
similar to Case V except that the cues 
are intercorrelated. Also, in this case 
Rè # 1.00. The differences between 
a clinician's beta weights and those 
for the environment are shown as 
(8... — 8. ); the differences between a 
clinician's cue utilization coefficients 
and the environmental validity coeffi- 
cients are shown as (re; — 7. (Be- 
cause the beta weight differences were 
large in these data, the limits of ac- 
curacy given by Formula 29 were 
too large to be of any interest for some 
of the clinicians. Therefore, Formula 
27a was used, which resulted in the 
limits shown in Table 5.) 

“Clinician No. 11” is a hypo- 
thetical clinician who matched re- 
sponse process-form with environ- 
mental function-form (R? = R.) and 
utilization coefficients with validity 
coefficients (dgd,=0). The data 
from Clinician No. 11 are used to 
show that, even if complete matching 
occurs in an environment where 

= .60, achievement can range 
from .20 to 1.00. Exactly where 
achievement will fall, within this 
range, will be entirely dependent upon 
the value of C, the correlation between 
the residual variances. 

Note that all 10 clinicians listed 
in Table 5 tended toward greater 
linearity than actually existed in the 
environment; the multiple correlation 
coefficient for each clinician is higher 
than that for the environment. This 
implies overweighting of the assump- 
tion of a linear, additive environ- 
mental function form, an overempha- 
sis which makes the upper limit of 
possible achievement for each clini- 
cian less than 1.00. 


But there is also evidence that 
assumptions aside from linearity affect 
the judgments of the clinicians, for 
there is some correlation between 
the residual variances in their judg- 
ments and the residual variance 
in the (Rorschach-Wechsler) environ- 
ment. These correlations suggest 
that the clinicians allow the pattern 
of Rorschach factors to guide their 
judgments of IQ to some extent, 
Note, however, that for 7 out of 10 
clinicians this correlation is negative, 
Therefore, the relation between the 
pattern of Rorschach factors and 
estimates of IQ tends to be the re- 
verse of what it should be. In these 
cases, the clinicians would have done 
better had they not considered the 
pattern effects of Rorschach factors 
at all. 

For example, Clinician No. 6, 
while attributing too much linearity 
to the situation (thus restricting his 
achievement) had a positive C (+.30). 
This increased his achievement. On 
the other hand, Clinician No. 8, who 
also attributed too much linearity to 
the environment had a large negative 
C (—.43) which decreased his achieve- 
ment. The ideal is approached by 
Clinician No. 10, whose R? is closest 
to that in the environment, and who 
has most nearly matched cue utiliza- 
tion coefficients with environmental 
validity coefficients. This gave No. 10 
the highest achievement limit of any 
of the clinicians, but his C of only .17 
kept his achievement considerably 
below its upper limit. 

In an ecology of this type, then, 
where linear relationships account for 
only little more than half of the vari- 
ance (i.e, R= 60), achievement 
depends partly upon perceiving the 
importance of individual cues, and 


partly on correctly perceiving the 


pattern effects of cues. Such pattern 
effects may be completely unknown 


to the experimenter, and they may 
also be inexpressible by the judge. 
However, if they are present, they 
may have a positive or negative effect 
on achievement. 

The significance of the above an- 
alysis is that it permits the investiga- 
tor to separate achievement into that 
portion due to pattern effects and that 
due to component effects.” 
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APPENDIX 
Proor OF SCHWARZ’S INEQUALITY FOR FINITE A 


is the same as 
(rei — n Bei — Bea)? < dde. 


To show that (A) is true, use induction 
on n. 


First, for n = 1, we have 
(XI Fi): < XP Y.. 
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Assume for n = k — 1 and 
b= XY; 
1 el t-i 
Cx. V/ SO XEY? at 
= ae 2K. Un VX S XEYR + VA. 
then Therefore 
k 4 k-i 5 k-1 k-1 
— Y= — XY 2 T X. V VX S T (XEY? + VEX 
it i=l 
a1 so 


HE X. VX. Yr + XF. k k 
m (B) S LX C Y? 


ot” ad 8 
SLX YA implies 
—1 j k k k 
mt (X X. T S T XE ve 
＋ 2 LX. UT X= (B). z * 
i=l 


which implies 


Since x 85 — 
A A 2 2 
|2ab| < a? + be, leta = X; Y, S* + LXIL Y; 
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AN INQUIRY INTO THE DOCTRINE OF 
REMOTE ASSOCIATIONS * 


NORMAN J. SLAMECKA 


University of Vermont 


The essential validity of the doctrine of remote associations was 
assessed. The findings of three original derived-list experiments 
supported a hypothesis of perception of list patterning, and were 
incompatible with predictions based upon remote associations, A 
fourth experiment supported the hypothesis that the association 
method produces its results through an artifact of serial position. 
Findings based upon the method of anticipatory and preservative 
errors were explained consistently by an alternative rationale that 
excluded remote associations. It was concluded that the doctrine of 
remote associations is of doubtful validity, and that serial memoriza- 
tion involves acquisition of the items per se, and then the learning 


of their positions in the list. 


In his now classic monograph on 
rote verbal learning and retention, 
Ebbinghaus (1885) established the 
concept of remote associations. His 
investigations led him to believe that 
during the memorization of a serial 
list of items, functional associative 
bonds were established not only be- 
tween immediately adjacent items but 
also directly, and at the same time, 
between items farther separated from 
each other in the list. These latter 
direct connections between noncon- 
tiguous items were designated as re- 
mote associations. They presumably 
arose automatically as part of the ordi- 
nary process of serial memorization, 
with no special effort on the part of 
the subject to bring them about. They 
were conceptualized as being formed 
both in the forward and backward di- 
rections, that is, an item in the middle 
of the list could develop a forward 
connection with the last item and a 
backward connection with one of the 
first items. As memorization of the 
series proceeded, it was thought that 


1This work was aided by National Sci- 
ence Foundation Grant No. G-14, 721. The 
author is grateful for the assistance ren- 
dered by Ernest Lemay and Muriel Whalen. 
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each and every item tended to form 
such a connection with every item in 
the list, thereby resulting in a complex 
bundle of associative interconnections 
among all the individual members of 
the series, as depicted in Figure i; 
Numbers represent the sequential po- 
sitions of the items as they occur in 
the list and arrows indicate associative 
bonds, with remote forward asocia- 
tions lying above, and remote backward 
lying below the item sequence. 
Furthermore, the strength of any 
given associative bond was asserted to 
be a negative function of its degree of 
remoteness. The bond between Items 
1 and 5, above, represents a third de- 
gree of remoteness (since it spans 
three intervening items), and it would 
be of lesser strength than the bond 
from 1 to 3, a first degree of remote- 


Fic. 1. Schematic illustration of as- 
sociative bonds among the items of a 
memorized serial list, according to the con- 
cept of Ebbinghaus. 
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ness (since it spans only one inter- 
vening item). Also, all backward as- 
sociations were considered to be much 
weaker than their counterparts in the 
forward direction. Thus, the remote 
backward linkage from 4 to 1, above, 
would be weaker than the remote for- 
ward linkage from 1 to 4. To sum- 
marize our description of the essential 

roperties of his concept, here is a 
— from Ebbinghaus (Wood - 
worth & Schlosberg, 1954) : 


term and several of those which follow it. 
. « The strength of these connections de- 


are... actually formed in a re- 
verse fe well as in a forward direction. 
siderably less for the reverse connections 
than for the forward ones [p. 710]. 

Since its inception some 77 years 
ago this formulation has remained, for 
all intents and purposes, essentially un- 
changed and has been accorded rather 
solid and widespread recognition in 
the conceptual armamentarium of the 
verbal learning theorist. Contemporary 
reference works in the experimental 
area discuss remote associations as an 
important part of the phenomena of 
serial learning (Hovland, 1951; Mc- 
Geoch & Irion, 1952; Woodworth & 
Schlosberg, 1954), and recognition of 
the concept is also given in several 
introductory psychology texts (Geld- 
ard, 1962; Kimble, 1956; Morgan, 
1961). 

After a close appraisal of the perti- 
nent literature and after conducting 
several experiments of our own, we 
have been forced to the conclusion 
that such widespread acceptance is 
definitely ill-advised, since there is 
much room for reasonable doubt about 
the validity of the remote associations 


doctrine (in our estimation it has in- 
deed become a doctrine), and that it 


serves no necessary purpose in the 
conceptualization of serial verbal 
learning. 


Experimental data from the remote 
associations literature have usually 
been regarded as proof that there in- 
deed are such functional connections 
between each item and all others in 
a serial list. However, it will be dem- 
onstrated that certain necessary de- 
ductions from the remote associations 
position are not borne out empirically, 
and that these data are susceptible of 
other equally (if not more) plausible 
interpretations. Further, it will be 
shown that phenomena previously at- 
tributed to the necessary influence of 
remote associations can be produced 
in the absence of any serial learning 
whatever. We shall present our sup- 
porting evidence in the sections to 
follow, sections delimited on the basis 
of the three types of experimental 
methods which have been utilized in 
the study of remote associations. 


METHOD oF Dertvep Lists 


This method, originated by Ebbing- 
haus, provided the sole source of the 
data which he presented as evidence 
for remote associations. An original 
list is memorized, then the same items 
are arranged in a different serial order 
and presented to the same subject 
again for learning. The latter is the 
derived list, and it usually involves a 
patterned rearrangement of the items 
with respect to their original arrange- 
ment. A first-order derived list is 
formed by skipping every other item 
from the original list, a second-order 
list skips two items at a time, and so 
on. If the sequence of appearance of 
items in the original list is designated 
as I, 2, 3, 4, 5, 6, 7, 8, 9, then a first- 
order derived list would have a se- 


mence of 1, 3, 5, 7, 9, 2, 4, 6, 8, anda 
scond-order list would be 1, 4, 7, 2, 5, 
& 3, 6, 9. The first-order list brings 
fogether items which were linked by 
a first degree remote association in 
priginal learning, and the second-order 
arrangement brings together items 
which were previously related by a 
second degree association. Keeping 
jn mind that remote asociations of the 
lowest orders are strongest, it then 
follows that a first-order list should be 
learned faster than one of second- 
order, and the latter still faster than 
a scrambled or control list whose items 
are in a random sequence. Ebbing- 
haus (1885) tested such a hypothesis 
and, indeed, found it supported by his 
data in terms of percentage savings in 
learning time. 
We cannot agree, however, that his 
experiment proved the existence of 
remote associations, for at least three 
reasons. First, Ebbinghaus learned 
by the method of whole presentation, 
wherein the entire list was always in 
view. It is possible that, inadvertently, 
he formed “remote” associations di- 
rectly, by attending to nonadjacent 
items in direct succession. Secondly, 
he read the lists in a regular rhyth- 
mical manner, and such an imposed 
organization itself probably influenced 
the ease with which certain associa- 
tions were formed, as shown by 
Müller and Schumann (Woodworth 
& Schlosberg, 1954). Third, and per- 
haps the more important objection, 
Ebbinghaus was his own subject. 
Since he prepared his own lists and 
was fully aware of the principle of 
their construction, it is quite pos- 
"sible that such knowledge could have 
produced the results, in spite of what- 
ever other precautions he did take. 
We turn, then, from this pioneering 
work to the subsequent experimental 
literature on remote associations using 
the method of derived lists, and find 
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relatively few references.* Cason 
(1926) found no significant facilita- 
tion in the learning of first-order lists 
and concluded that “when a series of 
items is well learned and practiced 
until the recitation is very smooth, the 
forward associations between the 
adjacent items are the only associa- 
tions that are formed [p. 316].” But 
we are left with some misgivings since 
the materials used were very long 
prose passages and a poem, and it is 
possible that the sequential depend- 
encies and contextual constraints found 
in such material served to mask any 
effects of remote associations. A study 
by Hall (1928) must be ruled out of 
serious consideration because her pe- 
culiar method of constructing derived 
lists precluded any meaningful con- 
clusions about remote associations. 
(McGeoch and Irion [1952] are also 
of this general opinion.) By far the 
best designed and controlled study 
was carried out by Lepley (1934), 
who used first-order lists and com- 
parable control lists of 12 nonsense 
syllables at six different retention in- 
tervals. Unfortunately, in no case was 
any first-order list easier to learn than 
its original list, and the same applied 
to a majority of the control lists as 
well. Such heavy and consistent 
negative transfer, resulting in a find- 
ing of no savings, renders the mean- 
ing of the results inconclusive for our 
purposes. Lepley (1934) himself 
wrote: “Apparently, the potent factors 
at work in the present study are fac- 
tors of interference of some sort and 
the only evidence of remote excitation 
we have results from a lesser degree 
of interference in the test lists at the 


2 Since we are concerned only with de- 
rived-list studies in which remote associa- 
tions were sought, we are not including a 
discussion of two studies in which the focus 
was upon adjacent backward associations, 
even though derived lists were used. 
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end of the thirty minute forgetting 
interval Ip. 16].” Such words are 
hardly calculated to convince the dis- 
interested reader of the reality of re- 
mote associations. 

Goldstein (1950), with relatively 
naive subjects, reported substantial 
savings over the original list in the 
learning of both a first-order list and 
a control list, but failed to find any 
savings advantage of the former over 
the latter arrangement, contrary to 
the results of Ebbinghaus. Apparently, 
degree of acquaintance with the ma- 
terials used is a factor in the outcomes 
to be expected. 

We will now describe our own ex- 
perimental efforts to clarify the ques- 
tion of the meaning of derived list 
data. One common characteristic of 
all materials used in previous studies 
is that, with the exception of the con- 
trol condition, there was always a 
rigidly patterned arrangement of items 
in the derived lists. That is, the 
ordering of the items in a derived 
list with respect to their original order 
was always regular or systematic. It 
is therefore conceivable that the sub- 
ject, given a derived list, might actu- 
ally perceive the orderly relation that 
it bore to the original (such as skip- 
ping every other item), and proceed to 
“learn” it by (for instance) a process 
of mediation. This might take the 
form of uttering aloud every other 
item, and bridging the gaps by silent 
utterance of the intervening items. If 
his retention of the original list was 
perfect he could “learn” the derived 
list on the first recitation trial and 

earn an impressive savings score, If 
such were the case, his behavior cer- 

tainly would not constitute proof of 
remote associative bonds, but only of 
perceptual alertness. Degrees of re- 
moteness beyond the first would be 
progressively more difficult to per- 
ceive, and thus Progressively fewer 
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savings would be the inevitable e- 
sult. Certain statements from previ- 
ous studies are provocative in ; 
regard. Cason (1926), who found no 
savings, wrote: “When the third a 
last part of the procedure had n 
completed, the subject was asked 
number of questions about the exp 
ment. None of the subjects noti 
the way the lists had been constructed 
. ++ [p. 309].” On the other hand 
in a report where substantial savings 
were obtained in a study of adjacent 
backward associations, Garret and 
Hartman (1926) commented: “When 
the reversed lists appeared, every & 
immediately guessed the order of the 
new arrangement, while the order of 
the reversed-pairs was at once evi- 
dent to three Ss... [p. 245].” 

Clearly, what is needed is some wa’ 7 
of minimizing the possibility that 
subject will have such an insi 
while at the same time permitti 
ings due to truly remote associat; 
There is nothing whatever in the 
eral rationale of the derived list method 
that requires the use of a regularly 
patterned list, just so long as it allows 
Temote associations to demonstrate 
their facilitating effects upon learning. 
Therefore, a modified type of derived 
list which is of the first-order on the 
average, should produce just as much 
savings as the regularly arranged first- 
order list, but without allowing the 
subject to benefit from perception of 
patterning. But if Perception of pat- 
terning is the critical variable, then 
the modified first-order list should be 
learned just as slowly as a control 
list? Experiment I was designed to 
test these divergent hypotheses. - 


-solvi profit- 
as identification of temporarily 
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TABLE 1 


Experi- 
ment 


MEAN TRIALS FOR OL anp Dertven Lists ror Exreriments I, II. ano II 


M SD M sD 

I 1.06 | .23 | 4.94 | 3.22 | 1.06 .23 | 7.61 | 3.59 r 7.94 | 2.51 
II | 20.45 | 8.22 13.30 4.40 | 19.95 | 8.01 | 12.95 | 7.31 20.60 6.27 | 13.55 | 5.17 
II | 19.90 | 4.42 10.90 4.71 | 20.25 | 7.32 | 12.05 | 3.28 | 20.80 | 4.81 | 14.10 | 5.02 


EXPERIMENT I 


A total of 36 students served as 
subjects as part of the general psy- 
chology course requirements. Each 
subject learned one original list and 
then three derived lists in immediate 
succession, all to a criterion of one 
perfect recitation. The order of the 
derived lists was counterbalanced 
across subjects. Learning was by 
serial anticipation on a memory drum 
with a 3-second rate of presentation 
and a 6-second intertrial interval. The 
original list items were 14 letters of 
the alphabet in alphabetical sequence, 
from A to N for half the subjects, 
and from M to Z for the other half. 
The three derived lists were: (a) a 
control list with items in haphazard 
order, (b) the usual patterned first- 
order list with every other item 
skipped, and (c) a modified first-order 
list with 0, 1, and 2 items skipped in 
an irregular fashion, but with an over- 
all mean of one skipped item. For the 
A to N list, it looked like this: A, D, 
E, G, H, K, M, B, C, F, I, J, L. N, 
corresponding to a skip sequence of 
2 0, 1, 0% % % e ume 
The necessary gap from M to B 
(indicated by the number in paren- 
theses) also occurred in the regularly 
patterned arrangement, and is a com- 


or spatially extended patterns . . . p. 
209].” Also, the well-known “discrimina- 
tion hypothesis” is based upon the organ- 
ism’s perception of regular sequences of 
events (Tyler, Wortz, & Bitterman, 1953). 


mon feature of derived list construc- 
tion, in order that all the items be 
used. All lists retained the same first 
and last items of the original arrange- 
ment. The results presented in terms 
of mean total acquisition trials are 
found in the top row of Table 1. 
Original learning (OL) was 1.06 trials 
because, oddly enough, two subjects 
needed two trials each to recite the 
alphabet, in spite of the fact that all 
subjects were told the initial letter and 
that there was an alphabetical se- 
quence. The overall F of 10.43 was 
significant beyond the .01 point (df= 
2/35), and Tukey's test showed that 
the patterned list was more rapidly 
learned than either the modified first- 
order, or the control list (.01 level 
gap = 1.86), with no difference be- 
tween the latter two lists. Further- 
more, 10 subjects learned the pat- 
terned list on the very first recitation 
trial, but this did not occur even once 
for the other two lists! It seems un- 
deniable that such extremely rapid 
acquisition must have been the result 
of perception of the pattern. We feel 
that these data definitely provide 
strong support for the perception of 
patterning hypothesis, and at the same 
time stand against the remote associa- 
tions doctrine, since the latter would 
maintain that the modified list should 
have produced savings equivalent to 
those of the patterned list. 

The fact that savings were not con- 
sistently obtained in all past studies 
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may be attributed to the subject's fail- 
ure to perceive the particular derived 
list patterning. The familiar alphabet 
series used for OL in Experiment I 
was deliberately chosen in order that 
it would facilitate rapid perception of 
the derived list pattern. On the other 
hand, we would predict that an origi- 
nal list of relatively unfamiliar items, 
such as nonsense syllables, would not 
produce facilitation in either type of 
first-order derived list. Although Eb- 
binghaus used nonsense syllable lists 
and reported savings in accord with 
remote association expectations, it can 
hardly be said that, for him, those 
materials were unfamiliar, since he in- 
vented them and was fully cognizant 
of their patterning. A truly naive sub- 
ject would less likely have insight into 
the systematic rearrangement used for 
the patterned list, and consequently 
would show no savings over the con- 
trol or modified lists. The remote as- 
socations position would predict that 
since the original list was learned, 
remote associations must have been 
formed, and subsequent savings of 
equal degree should be obtained for 
both the patterned and modified lists, 
over the control. Experiment II tested 
these hypotheses. 


Experiment II 


A total of 60 naive subjects drawn 
from the same source as before was 
divided into three groups of 20 sub- 
jects each. Each subject learned a 
single original list to a criterion of 
two successive correct recitations, fol- 
lowed immediately by acquisition of 
a single derived list to the same cri- 
terion. The same presentation rate 
and intertrial interval were used as 
before. The lists were made up of 
12 nonsense syllables with m’ values 
from 3.03 to 3.12 (Noble, 1961). Two 
forms of the original list were made up, 
using the same syllables but in dif- 
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ferent order, to lessen inadvertent se- 
quence effects. The three derived 
lists were of the same types as those 
used before: (a) a control list, (b) 
a patterned first-order list, and (c) 
a modified first-order (on the aver- 
age) list with the following sequence 
of skipped syllables: 2, 0, 1, 2, 0 (2), 
0, 2,1, 0,2. Prior to exposure to the 
derived list, each subject was told 
that it consisted of the same items as 
those in the original list, but in a dif- 
ferent order. (The subjects of Experi- 
ment I were also told this.) Results 
of Experiment II are shown in the 
second row of Table 1. It requires 
no statistical analysis to conclude that 
there were no significant differences 
in the rates of learning among the 
three derived lists. No subject gave 
any evidence of having detected the 
principle of arrangement of List b. 
Goldstein (1950) also found that a 
patterned list was learned no more 
rapidly than a control list. 

It is clear that simply arranging a 
derived list in patterned low-order 
form for learning by a naive subject 
does not automatically produce more 
rapid acquisition in comparison to the 
random or control arrangement. This 
fact is in contradiction to the specific 
expectation from the doctrine of re- 
mote associations, as is the finding 
of no savings for the modified list, as 
well. Some knowledge of the pattern- 
ing must be present, it would seem, 
before facilitation can reasonably be 
expected. 

What would the outcome of Ex- 
periment II have been if each subject 
did have knowledge of the principle 
of construction of his second list? We 
would predict that such knowledge 
would most benefit the group having 
the regularly patterned list, would 
least benefit the control group (know- 
ing that the list is a haphazard re- 
arrangement does not reduce much 
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uncertainty), and might conceivably 
benefit the modified list group to some 
extend (knowing that the next item 
is either 0, 1, or 2 items away in the 
original list should reduce some un- 
certainty). Experiment III was per- 
formed to answer this question. 


EXPERIMENT III 


Another 60 naive subjects were 
used from the same source as before. 
Essentially, Experiment II above was 
repeated, with the identical nonsense 
syllable lists, the three types of de- 
rived lists, and the same presentation 
conditions and acquisition criteria. 
But these subjects were given full 
knowledge of the nature of their re- 
spective derived lists. This was 
achieved by: numbering each syllable 
on the original list in consecutive order 
from 1 to 12 and retaining these same 
numbers when the list appeared in de- 
rived form, and informing the subject, 
just before presentation of the second 
list, of the precise principle underlying 
its rearrangement. Results are shown 
in the bottom row of Table 1. Dun- 
can's Multiple Range Test indicated 
significantly faster learning of the pat- 
terned list than of the control list 
(Rp3 = 3.02, df = 57, p < 05). The 
modified list occupied an intermediate 
position in mean trials but was not 
significantly different from either of 
the other two lists (Rp2 = 2.86, df = 
57, p > .05). Thus, with knowledge 
of the arrangement principle available 
to them, the subjects did best on the 
traditional Ebbinghaus type of list. 
Without such knowledge, as shown 
in Experiment II, they did no better 
than control subjects, favorable “re- 
mote” associations notwithstanding. 
Such a distinction would be foreign 
to the remote associations position, 
since it views formation of remote 
bonds as an automatic accompaniment 
of serial learning. 


For purposes of control it was 
necessary to check that the three types 
of derived lists used in Experiments 
II and III were of equal difficulty 
in and of themselves, apart from any 
transfer effects from the original learn- 
ing. Thirty naive subjects, from the 
same source as before, were each given 
one of the three list types to learn to a 
criterion of one errorless trial. Mean 
trials and subjects SD's for the pat- 
terned, modified, and control lists, re- 
spectively, were: 18.3 (5.14), 20.1 
(5.90), and 19.2 (5.81). The F was 
23 (df = 2/27), certainly not sig- 
nificant. It can be concluded that the 
derived lists per se did not differ in 
difficulty. 

We conclude from the above experi- 
ments that the derived list method 
in fact fails to provide convincing 
support for the doctrine of remote as- 
sociation, and instead, serves to give 
plausibility to an alternative account 
based upon the subject’s perception of 
list patterning. We will next con- 
sider the second method of studying 
remote associations. 


ASSOCIATION METHOD 


This method was introduced by 
Wohlgemuth (McGeoch & Irion, 
1952) and later elaborated by others. 
After having learned a serial list, the 
subject is presented with each item 
individually and required to react with 
the first response that occurs to him. 
In order to reduce extraneous re- 
sponses the subject may be instructed 
to restrict his associations by respond- 
ing only with a list item. In any 
event, the production of an item re- 
sponse which was not adjacent to the 
stimulus item in OL is taken prima 
facie as representing a remote associa- 
tion established through serial learn- 
ing. If Item 2 is the stimulus and 
the subject responds with Item 6, this 
is construed as a third degree remote 
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forward association. McGeoch and 
Irion (1952) appear to have no doubts 
about the essential superiority and 
validity of this technique when they 
assert that: 

Results obtained by the association method 
provide a more direct picture of the specific 
remote associations formed, are freer than 
the method of derived lists from the pos- 
sible masking influences of associative in- 
hibition, and leave no doubt that forward 
remote associations occur in relatively large 
numbers Ip. 96). 


McGeoch (1936) conducted an ex- 
periment using this technique, where, 
in addition to the associations, he also 
measured the reaction times of each. 
His results showed that there was a 
fairly steady decrease in the absolute 
number of associations (both back- 
ward and forward) as their degree of 
remoteness increased. Such a finding 
is in accord with expectations, since 
more remote associations are presumed 
to be weaker than less remote ones, 
and consequently less probable of 
emission. However, the forward re- 
action latencies themselves clearly con- 
tradicted such a position, since they 
tended to have shorter durations as 
they became more remote. Thus there 
is a nagging inconsistency in the find- 
ings as they relate to the doctrine of 
remote associations. 

Later, Raskin and Cook (1937) 
added a logical refinement to the analy- 
sis of data generated by this method, 
and their paper appears to be the 
last significant word on the develop- 
ment of the technique. They pointed 
out that it is improper to tabulate only 
the absolute number of remote associ- 
ations given, since the opportunities 
for their occurrence necessarily lessen 
as the degree of remoteness increases, 
A correction for opportunity must be 

made on the absolute totals. In their 
experiment they found, in agreement 
with McGeoch, that the absolute totals 
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showed a steady decline with increas- 
ing degrees of remoteness, but when 
corrected for opportunity there was a 
rise starting at the fourth degree, re- 
sulting in an overall U-shaped func- 
tion. They attributed this rise to 
summative effects of both backward 
and forward remote associations im- 
pinging upon the same item. They 
concluded that: 


remote forward associations are formed in 
the learning of a series of nonsense syllables, 
but... backward associations, also, are 
formed during the learning of such a series. 
As in the case of forward associations, the 
associations in a backward direction are 
formed between nonadjacent as well as 
adjacent syllables . . Ip. 393]. 


We vigorously dispute this conclu- 
sion and challenge the validity of the 
technique on which it was based, on the 
grounds that the association method 
involved a serious artifact which 
renders the results at least ambiguous 
with respect to the interpretation to 
be placed upon them. This flaw consists 
in completely neglecting the decisive 
implications of the serial position func- 
tion. It is a well-established empirical 
fact that the initial items of a list are 
most easily learned, the last items next 
most easily, and those just beyond the 
middle are the last to be mastered. 
Since serial learning proceeds by com- 
plete trials, wherein the entire list is 
always run through in succession, it 
follows that the subject will have dif- 
ferential amounts of practice in giving 
the correct responses to different items. 
That is, by the time the list is mastered 
the subject will have had considerable 
overlearning on the first items, less so 
on the last items, and only bare mastery 
of the middle ones, Now, generaliz- 
ing from the important work of such 
authors as Goldiamond and Hawkins 
(1958) and Underwood and Schultz 
(1960, Ch. 6), a relationship between 
the prior frequency of a response and 


degree of availability may be stated 
follows. Prefamiliarization of a 
nse, or the degree to which it 
been previously emitted, deter- 
the probability of its emission 
a subsequent free-responding situa- 
We suggest that the association 
hod embodies the essential aspects 
‘of such a prefamiliarization procedure, 
in that the subject gets differential 
practice on the various OL items (be- 
‘cause of their different serial po- 
sitions), and is later tested for the 
tive dominance or availability of 
his responses. Such a state of affairs 
should logically, in and of itself, pro- 
duce the pattern of results commonly 
reported with the use of the associa- 
tion method, and heretofore singly at- 
tributed to the operation of true remote 
associative bonds. We contend that 
results such as those of McGeoch 
(1936) and Raskin and Cook ( 1937) 
could be obtained with no serial learn- 
ing entering into the picture whatever, 
and that therefore they would not bear 
critically upon the question of whether 
remote associations are a valid by- 
product of serial learning. Experi- 
ment IV tested this hypothesis. 


ExPERIMENT IV 


Fifty-four naive psychology students 
participated as subjects. Each subject 
was given a controlled amount of dif- 
ferential practice on six nonsense syl- 
_ lables. The syllables, ranging in m 
value (Noble, 1961) from 3.51 to 
3.53, were: CIV, LAT, KEN, DIS, MOR, 
_ HAL. Each syllable was typed on a 
separate card, and after the deck of 
cards was well shuffled, the subject 
went through it, looking carefully at 

each card, spelling the syllable aloud 
and then pronouncing it. The fre- 
quencies of appearance in the deck of 
the six syllables were 25, 10, 3, 1, 3, 
10. This was an approximation to 
the general shape of a serial position 
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curve, and the subject went through 
this deck of 52 cards just once. There 
were three different arrangements of 
syllable-frequency pairings in order to 
control for idiosyncratic properties of 
any given syllable. Upon completion 
of this diferential practice, the sub- 
ject was tested by the association tech- 
nique. In order to avoid the emission 
of unusable, extraneous responses, the 
subject was asked to respond with the 
first syllable that came to mind from 
those he had seen, but not to repeat 
the stimulus term he was being given. 
He was presented with each of the 
six syllables in ha order, one 
time each. The results are displayed 
in Table 2. 

We may regard the items across the 
top margin of Table 2 as representing 
the sequence of items in a serial list 
which had, respectively, 25, 10, 3, 1, 
3, and 10 mean total correct anticipa- 
tions in OL. Along the left-hand 
margin are the stimuli presented dur- 
ing the association test, and the cells 
contain the total frequencies of par- 
ticular responses given as associates. 
One diagonal row of cells is blank, 
since instructions forbade responding 
with the given stimulus syllable. 
Entries in the top row were not ob- 
tained empirically but were derived 
from the marginal totals of 54, divided 
according to the proportions required 

the obtained column totals and are 
not included in the grand totals. The 
(X) represents what would have been 
the starting symbol for a serial list, that 
tells the subject to anticipate item 
number one. Since we lacked the 
foresight to include such an item dur- 
ing the association test, we decided, 
for completeness’ sake, to synthesize 
its effect in the controlled manner de- 
scribed in order to provide one more 
level of forward remoteness for the 
presentation in Table 3. 

The frequencies within any column 
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TABLE 2 


Torat Frequencies or ASSOCIATIONS OBTAINED IN Experiment IV 


Response items 
Stimulus Total 
iii : d Practiced Practiced 
ii 10 times 10 times if 
X 15 11 9 
75 21 14 54 
2 24 11 54 
3 14 9 11 54 
4 13 14 8 54 
5 26 8 9 54 
6 17 13 54 
Total | 94 | 65 53 324 | 


in Table 2 display some degree of un- 
accounted-for variability, but an in- 
spection of the overall column totals 
shows that the attempt to achieve a 
serial position function through ma- 
nipulation of the practice frequencies 
alone was reasonably successful, By 
the same token, our original statement 
relating degree of prefamiliarization to 
the subsequent degree of response 
availability was confirmed. Table 3 
shows the distribution of these same re- 
mote associations arranged in degree of 
remoteness, according to the custom- 
ary analysis of association method 
data. For instance, to obtain the total 
immediate forward responses (zero de- 
gree of remoteness), the frequencies 


TABLE 3 


TOTALS or ASSOCIATIONS IN EXPERIMENT IV, 
DEGREES or REMOTENESS = 


ABSOLUTE TOTALS AND CORRECTED 
ACCORDING TO 


Forward associations 


Degree of 
remoteness 


in Table 2 running from the upper 

left corner down to the right through 

15, 21, 7, 6, 9 and 9 were summed 

and appear as an absolute total of 67 

in Table 3. To obtain the immediate 

backward responses, the frequencies 
Tunning from the lower right corner 
of Table 2 up to the left through 6, 
5, 10, 9, and 24 were summed and 
appear as a total of 54 in Table 3. 
The corrections suggested by Raskin 
and Cook, and the resulting totals, 
are also presented. The “absolute | 
totals” columns show steady decreases | 
in the totals of remote associations as i 
degree of remoteness increases, in 
good agreement with McGeoch: (1936) 
and Raskin and Cook (1937). The 


— ͤ 


Backward associations 


ree | Sita | Cogen | comet 
| Mite — 
0 54 5/5 1 
1 40 5/4 50 
2 33 5/3 55 
3 39 5/2 97 
4 17 5/1 85 


INQUIRY INTO REMOTE ASSOCIATIONS 71 


“corrected totals” columns also tend 
to approximate the U-shaped function 
described by Raskin and Cook. 

Since it has now been shown that 
the overall results of the association 
method can be brought about without 
any previous serial learning, it follows 
that data gathered by that method are 
not at all conclusive with regard to 
the question of remote asociations, and 
constitute no clear proof of their ex- 
istence. Although the responses ob- 
tained in Experiment IV may be said 
to be associations to the stimulus items, 
they are certainly not the result of 
any serial learning procedure, and. are 
not, therefore, remote associations in 
the true meaning of the term. 

The last of the three methods used 
in the study of remote associations is 
discussed below. 


METHOD oF ANTICIPATORY AND 
_PERSEVERATIVE ERRORS 


When one attempts to recite a serial 
list from memory, he may omit a 
few items and give a response further 
ahead in the sequence, or he may re- 
. peat an item that occurred earlier in 
the sequence. These responses are 
customarily designated, respectively, 
as anticipatory and perseverative er- 
rors, The method of anticipatory and 
perseverative errors consists essen- 
tially in analyzing the pattern of such 
errors committed during serial 
memorization. It has been presumed, 
rather uncritically, that such errors 
represent ‘genuine remote associations 
formed by serial learning. The ap- 
parent advantages of this method, ac- 
cording to McGeoch and Irion, 
(1952), are: 


‘The anticipatory error method has certain 
advantages over the two 
The remote associations, whatever their de- 
gree of remoteness, are overt and specific, 
which is not the case under the method 
of derived lists, though it is by the as- 


% 


other methods. ` 


sociation method. Remote associations may 
be observed as a function of amount of 
practice in a single group of subjects. This 
cannot be done with the method of derived 
lists, and can be done with the association 
method only if an association test is made 
after each practice trial or after selected 
blocks of trials [p. 98]. 


Studies by Lumley (1932), Mitchell 
(1934), and Bugelski (1950) are rep- 
resentative of the use of this method, 
and their major findings may be sum- 
marized as follows: (a) anticipatory 
errors are much more frequent than 
perseverative errors, (b) frequency of 
such errors is inversely related to their 
degree of remoteness, even when cor- 
rected for opportunity of occurrence, 
and (c) as learning progresses, the 
number of far remote associations de- 
clines both absolutely and in propor- 
tion to the number of near remote as- 
socations. 

In order to verify such findings from 
some of our own data, we present in 
Table 4 an analysis of the OL per- 
formance, up to one perfect trial, of the 
60 subjects from Experiment II, above. 
Our data do indeed show the same 
pattern as those of previous reports: 
(a) out of a total of 1077 remote 
errors, only 62, or 5.8%, are in the 
backward direction; (b) there are 
fewer errors the higher their degree, 
even after correction; and (c) as 
fourths of learning increase, far re- 
mote errors drop off faster than near 
remotes. There is even a tendency, 
as Lumley (1932) observed, for near 
remotes to increase somewhat, at least 
up to the second fourth of learning. 

However, as may by now be sus- 
pected, our view of the meaning of 
data such as these is quite different 
from the usual interpretations that 
have been put upon them in the past. 
We choose, in order to test the 
strength of the traditional doctrine, to 
reject the assumption that these re- 
sponses are true remote associations, 
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TABL 
ToraL NUMBER or REMOTE ASSOCIATIONS 


ACCORDING TO DEGREE OF REMOTENESS FOR EACH FOURTH OF LEARNING, 
FROM OL or EXPERIMENT II 
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E4 
(FORWARD AND BACKWARD COMBINED) 


Degree of remoteness 


Fourths of ab Total 
learning 
1 2 3 4 5 6 7 8 9 
Ist 110 67 40 31 18 11 7 9 3 296 
2nd 188 86 26 24 11 2 5 2 1 345 
3rd 160 61 15 14 5 1 1 2 2 261 
4th 128 27 14 4 1 1 0 0 0 175 
Total 586 | 241 95 73 35 15 13 13 6 1077 
Corrected 
total 586 | 265 | 116 | 100 55 28 29 36 22 8 ‘ 


according to the sense of that con- 
cept. Simply because the subject 
makes an intralist error does not 
necessarily mean that it represents an 
association that has been built up 
specifically through serial learning 
any more than if he were asked to utter 
the first number between 1 and 10 that 
occurs to him when the stimulus “go” 
is given. It might be a response made 
Prepotent through much prior practice 
of it alone. The presumption that 
such errors indicate serially learned, 
direct connections between spatially 
and temporally separated items is 
gratuitous and highly questionable. 
Rather, they can be more simply and 
compellingly explained as being the 
joint result of serial position effects 
and educated guesses on the part of 
the subject in the following manner. 
As he learns, the subject must ac- 
quire the responses themselves (the 
“response-learning” phase of Under- 
wood and Schultz, 1960), as well as 
their relative positions in the list 
(somewhat akin to the “hookup” phase 
of Underwood and Schultz). During 
the early stage of acquisition the sub- 
ject has probably mastered only the 
first few (and possibly the last) items, 
thus reflecting the serial position in- 


fluence, and has but a very sketi 
grasp on the identity of the remaining 
items. There is, for him, an uncertain 
region in the central area of the list, 
the order of whose items has as yet 
not been fixed. However, the sub- 
ject does his best to follow instruc- 
tions and occasionally makes a guess 
as to the next item, uttering any one 
he happens to know except those he 
has just seen (therefore, very few 
backward associations), and those 
(probably the first ones) whose 
identity and position have been fixed. 
At this early stage he is more likely 
to emit remote associations of higher 
degree than later, since his ignorance 
of the list is maximal, and his guesses 
are less constrained. As learning pro- 
ceeds, the uncertain region of the list 
shrinks in extent, leaving fewer un- 
placed items to be guessed, thus 
logically insuring that the resulting 
total errors, as well as their degrees 
of remoteness, will be smaller. It can 
be seen in Table 4 that the error total 
for the second fourth of learning is 
higher than for the first fourth, and 
yet, according to the account thus far, 
it should be lower. This may readily 
be understood on the presumption that 
the response-learning phase has 
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reached completion during the sec- 
ond fourth of learning, and therefore 
the subject has more responses avail- 
able to him for potential guesses than 
he has during the first fourth. 

The preceding rationale predicts the 
obtained overall pattern of errors in 
Table 4 and in those of other studies 
of this type, without drawing upon 
any dubious assumptions concerning 
the development of any remote associ- 
ations. Such a rationale seems to us 
more parsimonious and somehow more 
promising than its alternative. For 
instance, it would be forced to predict 
that the distribution of the actual 
serial position sources of intralist 
errors would show a tendency, as 
learning proceeded, toward a greater 
concentration beyond the middle of 
the list, because of the progressive 
shrinking of the uncertain region. In 
other words, the variability of these 
errors should show a steady decline 
across successive fourths of learning, 
as acquisition progressed. The remote 
associations doctrine would, of neces- 
sity, be silent on this question, since 
it could deduce nothing specific from 
its premises alone. 

A test of this prediction was made 
from the OL data (up to a criterion 
of one perfect trial) of Experiment 
II. The variances of the distributions 
of intralist errors from the first to 
the last fourths of learning were, re- 
spectively: 7.29, 6.07, 5.20, and 3.91. 
Thus, the predicted decline in vari- 
ability was obtained, and the mean 
location in the list from which the 
errors came was about the eighth item, 
which lies precisely in the region last 
to be mastered. That part of our hy- 
pothesis which asserts that the subject 
guesses at items he has not yet learned, 
but refrains from naming items that 
have already gone by, has received 
empirical confirmation from a recent 
study by Peterson, Brewer, and 


Bertucco (1963). Those 


tested the possibility 


authors 


that the occurrence of remote associations 
is strongly influenced by a guessing strategy 
on the part of the Ss. In the ideal case, 
after the S had learned what the responses 
were, if he could remember what items had 
already appeared on a given trial, then he 
could confine his guesses to the remaining 
items [p. 258]. 


Their experimental results supported 
such a hypothesis. 

At this point we are convinced that 
data generated by the method of an- 
ticipatory and perseverative errors 
yield more fully to the above stated 
rationale than to the alternative ac- 
count invoking remote functional con- 
nections. 


Discussion 


The major conclusion to be drawn 
is that the classical doctrine of remote 
associations is doubtful, in the sense 
that the data upon which the doctrine 
was dependent do not in fact un- 
equivocally support it, but rather tend 
to support more strongly other ex- 
planatory hypotheses. These other 
hypotheses must be regarded, at the 
very least, as being strong alterna- 
tives to the classical doctrine in the 
light of the experiments described 
above. Nevertheless, what heuristic 
benefits has the classical doctrine been 
able to achieve? The main note- 
worthy use that has been made of 
it has been its application toward an 
explanation of the serial position 
function. Thus, Hull (1935) used 
it when he spoke in terms of trace- 
conditioned responses, and later 
Bugelski (1950), when he made di- 
rect use of the method of anticipatory 
and perseverative errors in develop- 
ing a serial position rationale. We 
must maintain, however, that since 
the premise upon which such explana- 
tions rest (that is, the asumption of 
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the existence of true remote associa- 
tions) has been shown to be open to 
question it follows that such explana- 
tions themselves are not highly likely 
to survive future investigation. Judg- 
ing by the evidence already marshalled 
above, it might be more prudent to 
say that the serial position function 
goes a long way toward explaining 
data usually attributed to the opera- 
tion of remote associations, and not 
the other way around. Whatever the 
ultimately accepted explanation of the 
serial position curve will be (and 
papers are still being published about 
that problem), is is unlikely that re- 
mote associations will play any part 
in it. 

What more positive statements can 
be made at this time about the con- 
ceptualization of serial memorization ? 
Certainly, we feel that remote associa- 
tions are out of the question, but it 
would be tempting at least to agree 
that adjacent connections between 
items are formed. However, even 
such an assumption cannot be made 
with confidence, since recent develop- 
ments have rendered it highly ques- 
tionable. Young (1961, 1962) has 
presented a great deal of convincing 
data to the effect that the stimulus 
for an item in a serial list is not the 
preceding item at all, but that it may 
instead be the serial position of that 
item. Also, Ebenholtz (1963) has 
demonstrated clearly that a serial list 
can be learned quite satisfactorily 
without the necessity of forming 
adjacent sequential connections in the 
list, but rather through associating the 
items with their spatial positions. 
With these developments to guide us, 
the most promising (and challenging) 
statement we can offer to describe 
serial learning is that it involves ac- 
quiring of the items per se, and then 
the fixing of their relative positions 
in the list, through associating them, 
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not with each other, but with a self- 
generated sequential or spatial symbol 
(such as first, second, etc.). Both 
of these processes probably overlap in 
time, with the response-learning stage 
being initiated first, and the position- 
learning stage starting shortly there- 
after, in much the same manner as 
the two phases discussed by Under- 
wood and Schultz (1960). Tt should 
be emphasized that this position does 
not deny the existence of associative 
bonds, but it does deny that serial 
learning proceeds by a process that 
results in simple chaining whereby 
one item is the stimulus for the next, 
and it also denies the necessity for 
postulating any remote associations, 
Rather, the associations formed are 
more likely to be between the item 
and some distinctive symbol designat- 
ing its relative position in the list, 
and are the result of an active search- 
ing process on the part of the subject. 
An apposite summarization of the 
serial learning situation was made 
many years ago by Woodworth 
(1938) : 


the learn 
tions, 
list of number or Nonsense syllables, you 
to be done was 
adjacent terms, 


Proceeds largely by 


5 analysis and the finding 


Parts and relations [p. 35]. 


SUMMARY 


Previous studies relating to the doc- 
trine of remote associations were re- 
viewed, and several new experiments 
testing the validity of the concept were 
presented. 

The overall results of Experiments 
I, II, and IIT Served to support an 


i 
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hypothesis based upon perception of 


derived-list patterning, and were es- 
sentially incompatible with predictions 
based on the doctrine of remote as- 
sociations. 

Experiment IV gave support to a 
hypothesis that the association method 
produces its results because of an arti- 
fact, namely, differential practice on 
the correct responses because of the 
serial positions of list items. It was 
concluded that the association method 
was totally inconclusive with regard 
to the question of the reality of re- 
mote associations. 

Findings based upon the use of the 
method of anticipatory and persevera- 
tive errors were consistently accounted 
for by a rationale based on serial po- 
sition effects and the subject’s guess- 
ing pattern, and a further prediction 
based on that rationale was confirmed. 

It was concluded that the doctrine 
of remote associations is doubtful, and 
that other alternative hypotheses merit 
serious consideration. It was suggested 


that serial memorization involves the 


acquiring of the items per se, and then 
the fixing of their positions in the list 
through associating them with self- 
generated sequential or spatial sym- 
bols. 
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THEORETICAL NOTES 


CURARE DRUGS AND TOTAL PARALYSIS 


KENDON SMITH 
University of North Carolina, Greensboro 


Question is raised concerning the efficacy of curare drugs in producing 
total striate paralysis. Itis pointed out that early Es reported muscular 
activity under deep curarization; and that thinking, quite possibly 


muscular in nature, is unhampered by curarization. 


It is further sug- 


gested that the “exophthalmic response,” conditionable under curare, 
be more fully elucidated as a nonstriate response. Implications are 
drawn for recent experimentation in the area of conditioning under 


curare, 


Two recent articles by Solomon and 
his co-workers (Black, Carlson, & Solo- 
mon, 1962; Solomon & Turner, 1962) 
have reviewed the history of curare 
drugs in the study of learning. The same 
articles have also provided accounts of 
the authors’ own research in this area. 
The historical reviews and the experimen- 
tal contributions are, I believe, thorough 
and valuable. At the same time, how- 
ever, I should like to describe a reserva- 
tion which might be attached to both 
of them. 

The conclusions reached by the papers 
cited depend completely upon the as- 
sumption that curare and its variants 
produce “total paralysis,” that “there 
can be no skeletal response” in the 
curarized animal. Indeed, this is a view 
coming to be generally held (e.g., Munn, 
1961, p. 491). It is the essence of the 
reservation lodged here, nevertheless, 
that there are good reasons for caution 
in subscribing to such a view. 

The first of these reasons was pointed 
out several years ago (Smith, 1954): 
the pioneer experimenters in the area 
of learning under curare were quite 
explicit about residual muscular re- 
sponses even in deeply curarized animals. 
As Solomon and Turner (1962) them- 
selves remark of the early work of Girden 
and Culler (1937): 


while the subject [a dog] was totally curarized 
„the US did elicit a twitch of the semi- 
tendinosus muscle, even though it did not 
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produce a limb movement. A good semi- 
tendinosus muscle twitch CR developed in re- 
sponse to presentation of the CS under curare 


[p. 205). 


Girden and Culler's (1937) own report 
goes into considerable additional detail 
about the exact effects of curare upon 
neuromuscular activity. Thus, they 
point out that, 


bursts of twitching or “shivering” would 
ripple over the body; this exemplifies the 
strychnine-like effect of the drug on the spinal 
cord, and is not a symptom of inadequate 
curarization [p. 266]. 


Weak stimulation of the motor roots 
of the spinal cord produced in curarized 
muscle a response which “was definitely 
stronger [p. 271)“ than normal. As 
Girden and Culler finally point out. 


The curare effect seems to consist more in 
depressing the reaction to strong stimuli than 
in raising the limen for weak stimuli. Curare 
exhibits a kind of hobble· effect the muscle 
can react thus far and no farther,” whatever 
magnitude of stimulus is employed; but its 
response to liminal or near-liminal stimuli 
is but little affected . . . Indeed the strych- 
nine-effect seems to enhance the stimulating 
value of a near-liminal shock on the motor- 
root [pp. 271-272]. 


Girden (1942) later worked extensively 
with conditioned responses in the striate 
musculature of dogs more or less deeply 
under the effects of either curare or 
erythroidine (one of the first of the 
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synthetic curares). In fairness, it must 
be said that, although Girden reported 
successful conditioning even in thor- 
oughly drugged animals, he did remark 
that, “When the animal is in the very 
deep state, no overt response other than 
the direct contraction of muscle to shock 
. .. occurs. At this point, of course, 
striate CR cannot be evoked at all 
(pp. 116-117) '; but it must be further 
pointed out that Girden was working 
now with the gross behavior of intact 
animals rather than with the closely 
observed responses of single muscles. 

Strictly speaking, of course, these 
findings reflect only upon curare itself 
and upon one of its early variants. A 
second reason for caution lies, however, 
in the suggestion by later, independent 
evidence that even the modern versions of 
the drug have merely a “hobble-effect” 
upon the striate musculature. The evi- 
dence in point comes from the rather dra- 
matic demonstration carried out by Smith, 
Brown, Toman, and Goodman (1947). 
As is now well known, Smith allowed 
himself, in this demonstration, to be 
placed deeply under the effects of 
d-tubocurarine ; and appropriate tests re- 
vealed “no changes. . . in the electro- 
encephalogram, consciousness and sen- 
sorium, or in any aspect of higher central 
nervous system function [p. 13].” One 
way of interpreting these findings, and 
the way to which Solomon and his co- 
workers (Solomon & Turner, 1962) 
would undoubtedly be partial, would 
be this: 


Inasmuch as thought processes are seen to 
go on unhampered in a totally paralyzed 
individual, thinking cannot depend upon the 
function of the striate musculature; and a 
motor theory of thinking is thus rendered 
untenable. 


1 The present writer had occasion, at about 
this time, to use curare in suppressing the 
activity of the intra-aural muscles of the cat 
(Smith, 1943). The drug was helpful; but, 
even in the deeply curarized animal, totally 
unresponsive by the usual tests and being 
maintained by artificial respiration, residual 
activity in the tensor tympani and stapedius 
muscles was clearly evident in the variability 
of the electrical response of the cochlea. 


Another, and, it seems to me, at lea 
equally defensible way of interpretin 
the same findings would be this: 


Inasmuch as thought processes are seen to 
go on unhampered in a deeply curarized 
individual, and inasmuch as there exists a 
fair body of evidence favoring a motor theo 
of thinking, it would appear that even radica 
curarization does not totally abolish response 
in striate muscles, } 


It will be seen that the latter view fits 
in nicely with the evidence from the 
earlier animal studies. 

A final question is raised by the work 
of Black, Carlson, and Solomon (1962) 
themselves. Using both d-tubocurarine 
and another variant of curare, ‘‘Flaxedil,” 
and working with dogs as subjects, they 
were able to condition what they have 
called the “exophthalmic response“ 
an outward bulging of the eyeb 
Since this bulging is undoubtedly a 
muscular effect, and since the only 
generally known muscles which mighi 
cause the effect are the striate extrinsic” 
muscles of the eye, the immediate sug- 
gestion, again, is that at least som 
responsivity remains in curarized stria 
muscles. Solomon has indicated in per- 
sonal communication? that a species of 
smooth muscle lying behind the eyeball is 
responsible for the exophthalmic effect, 
and such a muscle does indeed seem to 
exist (Kronfield, 1943, p. 35; White, 
Smithwick, & Simeone, 1952, p. 252). 
The substantive issue is thus resolved 
but the exophthalmic response would ap- 
pear to deserve a more complete public 
explanation than it has received to date, 

The qualification which these con- 
siderations suggest is plain. It may bi 
much more likely than first appears tha 
the somatic nervous system and th 
skeletal musculature are importantly 
involved in conditioning under curai 
and under other drugs related to it. 
We are not yet forced to share the view 
expressed by Solomon and Turner when 
they remark: S 


we can definitely maintain that differential 
peripheral skeletal responses (and their associ- 


R. L. Solomon, personal communication, 
March 18, 1962. -€ 
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ated feedback) could not have been a factor in 
mediating the discrimination learning which 
we have clearly demonstrated. Thus, those 
theories of transfer of training and of thinking 
which restrict themselves to peripheral skeletal 
response mediators, and their correlated feedback 
from the periphery, cannot explain the outcome 
of our experiment [p. 217; italics in original]. 
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CARDIAC CONDITIONING AND SKELETAL 
RESPONDING IN CURARIZED DOGS! 


A. H. BLACK AvD W. M. LANG 
McMaster University, Ontario 


Smith (1963) has presented ‘‘reserva- 
tions” concerning the appropriate theo- 
retical interpretation of some recent 
data on conditioning under curare (Black, 
Carlson, and Solomon, 1962; Solomon 
and Turner, 1962). These data show, 
first of all, that Pavlovian conditioning 
of cardiac, pupillary, and exophthalmic 
responses occurs in curarized subjects, 
and secondly, that such conditioning 
affects subsequent behavior (both auto- 
nomic and skeletal) in the normal state. 
These data were interpreted as indicating 
that conditioning in curarized subjects 
“provides evidence to support the posi- 
tion that autonomic conditioning does 
occur independently of skeletal activity” 
(Black, Carlson, & Solomon, 1962, p. 
25) It was suggested further that, 


Since Pavlovian conditioning procedures un- 
der curare seem to affect subsequent skeletal 
behavior in the normal state, the manner in 
which this effect is mediated becomes im- 
portant for all S-R reinforcement theories. 

We are forced to consider how “infor- 
mation” acquired under curare, when there 
can be no skeletal response, “carries over” 
or transfers so as to influence skeletal re- 
sponses in the normal state. Three possi- 
bilities immediately suggest themselves: One 
is some form of “central” mediation or “pure 
cognition.” Another involves peripheral 
“mediation” via differential autonomic re- 
sponding and its correlated, discriminated, 
feedback stimuli. And finally, combination 
of these two possibilities might be postulated, 
(1962, pp. 28-29). 


1This work was supported by Research 
Grant MY-2741 from the National Institute 
of Mental Health, United States Public 
Health Service to A. H. Black. 

2 This is not to say that skeletal activity, 
when it occurs, cannot affect the form of 
the cardiac response. There is considerable 
evidence to show that it can. Rather, the 
point is that skeletal activity is not necessary 
for the occurrence of the cardiac response, 


Smith (1954, 1963) argues first, that 
interpretations such as these are tenab! 
only if absolutely no skeletal responding 
at all occurs under curare-like drugs, and 
secondly, that the curare experiments 
do not preclude the possibility of skeletal 
activity which could mediate the condi 
tioned response. i 

We do not agree with these conten: 
tions. Rather, we would argue as fol- 
lows: (a) It is only when the definiti 
of a “skeletal response” is left vague th 
Smith’s criticisms seem valid. € 
the concept skeletal response is oper. 
tionally defined, then the curare experi- 
ments provide evidence to support th 
hypothesis that autonomic conditionin 
and transfer between curarized an 
normal states do occur without skele 
responding. (b) Even if one assum 
that some fragmentary skeletal respond- 
ing takes place in curarized subjec: 
one must make assumptions which seem 
highly unlikely in order to maintain 
motor-mediation theory of conditionin! 
and transfer. These arguments ar 
amplified in the following discussion. 

1. According to Smith, the hypothes 
which we must verify is that “skelet 
behavior does not occur in curarize 
animals.” His main point is that if we 
fail to observe skeletal behavior in 
curarized subjects, we ought not 
“accept” the above hypothesis. 
we ought to state simply that we ha 
“failed to reject it” because there may 
be techniques for determining the pres- 
ence of skeletal behavior that are mo 
sensitive than the ones which we 
employed. 

One can interpret this criticism in tw 
ways, either as a logical or as an em 
pirical criticism. First, it could be 
Statement of the impossibility of th 
absolute verfication of any hypothesi 
If our experiments fail to show skele 
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behavior in curarized subjects we cannot 
conclude that we have proved our hy- 
pothesis. Interpreted in this manner, 
the criticism is concerned with the logic 
of inductive inference, and simply points 
out that we can never have absolute 
certainty concerning any hypothesis. 
In this sense, Smith’s criticism is correct. 

Secondly, even though experimental 
results cannot prove our hypothesis 
absolutely, it seems reasonable to think 
that they could provide some supportive 
evidence for it. If this position is ac- 
cepted, Smith’s criticism seems to imply 
that the evidence already presented is 
not sufficient to support the hypothesis 
that skeletal behavior does not occur 
in curarized subjects. As Smith (1954) 
has stated in an earlier paper, ‘Unless 
an experimenter is meticulous in the 
elimination of skeletal responses, it will 
always be possible to account for visceral 
conditioning by postulating undetected 
somatic activity [p. 223].” In this 
second interpretation of Smith’s criti- 
cism, the basic problem seems to be in 
determining empirically how to eliminate 
skeletal responses “meticulously.” This 
can be accomplished only by first opera- 
tionally defining a skeletal response. 
Thus, if we could reach agreement as to 
what operationally constitutes a skeletal 
response, then presumably, there would 
be no problem in setting up an experi- 
ment which would allow one either to 
reject or accept the hypothesis that 
skeletal behavior does not occur in 
curarized subjects. 

In our previous experiments we de- 
fined a skeletal response as an overt 
response; that is, one which was observ- 
able by the experimenter in the intact 
organism. Conditioning of autonomic 
responses and transfer occurred when 
skeletal responses, so defined, were pre- 
vented. A more stringent operational 
definition of a skeletal response is pro- 
vided by the electrical activity recorded 
from the muscle. We have made obser- 
vations using this approach as well, 
recording muscle potentials (EMG) and 
cardiac responses concomitantly in curar- 
ized dogs. A brief description of these ex- 
periments follows. Muscle potentials in 


experimentally naive dogs were recorded 
from subcutaneous needle electrodes 
feeding into EEG preamplifiers on a 
Grass Polygraph. Four locations were 
used, (a) over the biceps femoris on the 
lateral surface of the left hind limb 
(L-BF), (b) over the biceps femoris on 
the lateral surface of the right hind limb 
(R-BF), (c) over the sartorius on the 
dorsolateral surface of the right hind 
limb (R-S), and, (d) on the forehead (F). 

The dogs were conditioned in a sound- 
proof chamber. Stimulating and record- 
ing equipment were kept in an adjoining 
room. The CS was a 4,000-cycle tone; 
the UCS a 4-milliampere electric shock. 
The CS-UCS interval was 10 seconds, 
and the shock duration 5 seconds. A 
delayed conditioning procedure was em- 
ployed. Curarization was carried out by 
injecting d-tubocurarine chloride through 
a polyethelene catheter which had been 
inserted into the recurrent tarsal vein. 
A Harvard infusion pump was employed 
in order to control the rate of injection. 
D-tubocurarine chloride was injected 
slowly until respiratory failure occurred. 
An endotracheal tube was then inserted 
and artificial respiration begun, using a 
Harvard Respiration Pump. Drug in- 
jections were continued throughout the 
course of the experiment. 

In the typical polygraph records which 
follow, the top two channels present 
muscle potential, the third EKG, and 
the fourth presents a tachographic re- 
cording of the heart rate. Also, a signal 
marker is superimposed on the fourth 
channel. The tachograph presents each 
interbeat interval as a vertical distance. 
The shorter the interval between beats 
(and the faster the heart rate), the less 
is the vertical displacement of the pen. 
The onset of the CS is represented by an 
arrow above the signal marker line, and 
the occurrence of UCS is represented by a 
solid bar on the signal marker line. In 
order to avoid 60 cycle noise, the muscle 
potential channels were switched off just 
before shock presentations. 

In some of the following records, slight 


3 D-tubocurarine chloride was kindly sup- 
plied by E. R. Squibb and Sons of Canada, 
Limited. 
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Fic. 1. Heart and muscle potentials fol- 
lowing cessation of cardiac activity. (A— 
Dog N-1. This dog died during the course of 
the final experimental session. B— S-3. 
This dog was sacrificed, using Nembutal, fol- 
lowing completion of the experiment.) 


artifactual perturbations in the EMG 
tracings were observed. These were of 
three types. First of all, there were arti- 
facts which were associated with the 
operation of the equipment controlling 
stimulus presentations. Secondly, rhyth- 
mic oscillations occurred which seemed 
to be associated with the heart beat. 
These can be attributed to the movement 
of the dog and electrodes, produced by 
the force of the heart beat. This ballisto- 
cardiac effect was especially noticeable in 
curarized dogs. It could be accentuated 
or decreased by changing the position of 
the dog with respect to the electrodes and 
lead-in wires. Finally, there were small 
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716. 2. Heart rate and muscle potentials 
in the normal state for dog S-4. (See text for 
explanation of trace identifications. Calibra- 
tions: time, 2 seconds; EMG amplitude, 100 
microvolts. ) 


artifactual changes in the EMG which 
seemed to be produced by the recording 
system itself. These were observed as in 
Figure 1 where records are shown for 
two dogs following cessation of the heart 
beat. Presumably, changes in the EMG 
at this time would not be attributed to 
skeletal movements. None of these 
types of perturbation in the EMG 
records were scored as skeletal activity. 

Figures 2 and 3 show data for a typical 
dog during an experiment on simple 
Pavlovian conditioning. In Figure 2 
the records show EMG and heart rate 
activity in the normal state before 
curarization. Figure 3 shows data for 
the same dog following curarization. 
Record A of Figure 3 shows the first 
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Fic. 3. Heart rate and muscle potentials 
during Pavlovian conditioning in the curarized 
state for dog S-4. (A—Acquisition Trial 1. 
This trial followed the administration of 36 
milligrams of d-tubocurarine chloride over 90 
minutes. B—Acquisition Trial 16. This trial 
followed the administration of 102 milligrams 
of d-tubocurarine chloride over 210 minutes. 
Calibrations: time, 2 seconds; EMG ampli- 
tude, 100 microvolts.) 


acquisition trial, and Record B, the 
sixteenth acquisition trial. On the six- 
teenth trial there was a clear-cut heart 
rate response (the difference between 
the rate during the 10 seconds CS- UCS 
interval and the 10 seconds preceding 
CS onset was 33 beats per minute) but 
no apparent change in EMG. For 
this dog, cardiac conditioned responses 
occurred with no detectable muscular 
movement. 

In a second procedure, one dog was 
trained in the normal state to avoid 
shock by flexing its right hind leg. This 
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procedure was designed to increase the 
probability of skeletal responses to the 
CS in subsequent conditioning under 
curare. Data for this dog are presented 
in Figures 4 and 5. Figure 4 shows the 
first (Record A), and fifth (Record B) 
trials on the tenth day of avoidance 
training in the normal state. When the 
first trial was conducted (A) there was 
a considerable number of intertrial 
responses, but by the fifth trial (B) these 
had disappeared, and the only large 
responses that occurred were in the 
presence of the CS. Following the fifth 
trial the dog was curarized. Curare was 
administered slowly in order to observe 
the gradual decline in skeletal responding 
to the CS. If any EMG activity was 
noted in the presence of the CS, the CS 
was terminated and shock avoided for 
that trial. If no muscle activity was 
noted, the CS and UCS were both pre- 
sented. Record A in Figure 5 shows 
data for the second trial under curare 
when some EMG activity was noted. 
In Record B there seems to be no ap- 
parent movement to the CS. There is, 
however, a very small (approximately 2 
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Fic. 4. Heart rate and muscle potentials 
during avoidance conditioning in the normal 
state for dog N-1. (A—Acquisition Trial 1 
on the tenth day of avoidance conditioning. 
B— Acquisition Trial 5 on the tenth day of 
avoidance training. EMG amplitude was re- 
duced in these records because of the con- 
siderable activity displayed by the dog. Cali- 
brations: time, 2 seconds; EMG amplitude, 
100 microvolts.) 


Fic. S. Heart rate and muscle potentials 
during conditioning in the curarized state 


for dog N-1. (A—Trial 2. On this trial 
some EMG activity occurred and shock was 
omitted. This trial followed the administra- 
tion of 18 milligrams of d-tubocurarine chlor- 
ide over 33 minutes. B—Trial 29. On this 
trial the CS and UCS were paired. This trial 
followed the administration of 120 milligrams 
of d-tubocurarine chloride over 200 minutes. 
Calibrations: time, 2 seconds; EMG ampli- 
tude, 100 microvolts.) 


beats per minute) but stable! cardiac 
response. Following curarization, the 
dog was again tested in the normal state 
and the avoidance response occurred in 
all 10 test trials. 

These data seem to support further the 
hypothesis that cardiac conditioned re- 
sponses can occur in curarized dogs when 
skeletal activity, as defined by change 
in EMG level, is prevented. 

These results, of course, do not meet 
Smith’s logical criticism. There is always 
the possibility that some undetected 
form of skeletal muscle activity could 
have occurred in this experiment. How- 
ever, there is one point that should be 
made here; that is, the same logical argu- 
ments apply to the hypotheses involved 
in a motor-mediation theory of condition- 
ing. For example, even if evidence were 
provided to show that skeletal responses 
were found whenever a cardiac condi- 
tioned response took place, one could 
argue that some situation could be 
discovered in which cardiac conditioned 
responses would occur without skeletal 


The difference between pre- and post- CS 
heart rates averaged between 2 and 4 beats 
per minute on this series of trials. 
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Fic. 6. Heart rate and muscle potentials 
during Pavlovian Conditioning in dog S-3. 
(A—A conditioning trial under complete 
curarization. This trial followed the admin- 
istration of 42 milligrams of d-tubocurarine 
chloride over 140 minutes. B—A condition- 
ing trial after the administration of Tensilon 
which produced partial recovery of skeletal 
activity. This trial occurred approximately 3 
hours following the trial shown in Record A. 
Calibrations: time, 2 seconds; EMG ampli- 
tude, 100 microvolts.) 


activity. Thus the logical argument 
provides no basis for choice between the 
two positions. Rather, such a choice 
would have to be made on the basis of 
the available evidence, incomplete as it 
may be, and the present evidence does 
support the hypothesis that autonomic 
conditioned responses can occur in 
curarized dogs without being mediated 
by skeletal activity. 

2. Even if one does assume that some 
undetected form of skeletal behavior 
occurs in our curarized preparation, the 
results of our previous experiments, in 
conjunction with the data described 
above, still have relevance to motor 
theories of learning and transfer. Smith 
(1954, 1963), in his outline of a motor 
theory of thinking, makes three main 
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points. First, cognitive activity is 
dependent on skeletal behavior. Under 
the heading of cognitive activity would 
be included both the process of condi- 
tioning and transfer from the curarized 
to the normal state. His second point 
is that skeletal feedback is crucial in the 
Pavlovian conditioning of autonomic 
responses; i.e., the only way in which 
autonomic Pavlovian conditioning can 
be explained is by assuming that auto- 
nomic responses are simply artifacts of 
the conditioned skeletal responses, or, 
when the skeletal response is conditioned, 
then the autonomic response occurs 
because of innate neural connections 
relating feedback from the skeletal 
responses to the autonomic nervous 
system. His third point is that the 
cognitive system will be driven ade- 
quately even if skeletal responding is 
reduced, as long as the amount of skeletal 
responding is greater than zero. This 
implies that most skeletal behavior is 
redundant, and that we only need mini- 
mal skeletal activity in order to instigate 
full cognitive activity. 

It is at this point that the heuristic 
deficiency in Smith’s system clearly 
shows itself. If autonomic conditioning 
and the transfer between curarized and 
normal states are artifacts of skeletal 
activity, one would expect to observe 
some relationship between skeletal be- 
havior and these phenomena. Thus, 
in order for a motor-mediation theory 
to be useful we would argue that it is 
not enough to demonstrate that some 
skeletal activity occurs in curarized 
subjects. One also must show a rela- 
tionship between specific, designated 
Patterns of motor behavior and auto- 
nomic conditioning and transfer. 

Data which bear on this point are 
shown in Figure 6. Record A shows a 
conditioned cardiac response in a typical 
curarized dog in a simple Pavlovian 
conditioning experiment. There is an 
obvious heart rate response to the CS 
but no apparent EMG response. Three 
hours after completion of the condition- 
ing phase of the experiment, Tensilon 
was administered in order to counteract 
the effects of curare, and further condi- 
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tioning trials were run. One such trial 
is shown in Record B. The amount of 
Tensilon injected permitted partial re- 
covery of skeletal responding. At this 
stage of recovery, the skeletal activity 
consisted mainly of bursts of trembling 
and shivering in phase with the respira- 
tory cycle. This behavior pattern is com- 
mon among dogs recovering from curare 
paralysis when body temperature drops. 

If there were some relationship be- 
tween skeletal activity and the cardiac 
response we would expect a change in the 
pattern of skeletal activity at the same 
time, or slightly preceding, the onset 
of the cardiac response. Furthermore, 


if skeletal activity were mediating trans- 


fer from the curarized to normal state, 
we would expect similar patterns of 
skeletal activity in the two states when 
cardiac responses were similar in both 
states. In the top channel of Record 
B, skeletal activity is higher during the 
CS period (especially between bursts of 
trembling) than during the pre-CS 
period. This is particularly noticeable 
during the last few seconds of the CS- 
UCS interval. Such correlations be- 
tween cardiac and skeletal responding 
are common in the fully normal state 
also. However, there is no such rela- 
tionship between the skeletal activity 
and cardiac responses in the curarized 
state (Record A). In addition, the 
heart rate response cannot be completely 
accounted for by correlated changes in 


muscular activity since there was a 


cardiac response in Record A very similar 
to that shown in Record B, but no change 
in muscular activity in A similar to that 
in B. Thus, these data provide no clues 
as to what EMG changes could correlate 
with cardiac responses under curare, 
and as to what properties of the skeletal 
response would act as mediators of the 
cardiac conditioned response between 
the curarized and normal states.“ 


t While these data suggest that skeletal 
activity concomitant with the cardiac response 
is not necessary for the occurrence of that 
cardiac response, they do not preclude the 
possibility that skeletal responses may be re- 


In summary, then, our available 
evidence suggests that skeletal activity 
in curarized subjects does not occur 
closely correlated in time with cardiac 
conditioned responses. Furthermore, 
while the curare experiments do not rule 
out the possibility of any conceivable 
type of skeletal activity, they do force 
a motor-mediation theory of conditioning 
to become specific enough to be testable. 
What is a useful operational definition 
of skeletal activity that might occur 
under curare? What are the properties 
of skeletal behavior which might be 
correlated with autonomic conditioned 
responses under curare? What are the 
properties of skeletal behavior that might 
mediate cardiac transfer between the 
curarized and normal states? Until 
these questions are answered, and until 
some evidence is provided to support a 
motor theory, it seems sensible to con- 
clude that autonomic responses can occur 
without skeletal responding. 
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lated to cardiac conditioning in some more 
complex manner. For example, one might 
argue that skeletal activity during the action 
of the UCS on at least one or two trials is 
necessary before conditioning will occur. 
This hypothesis could not be tested since no 
EMG records were taken during the action of 
the UCS. It is not our intention, however, 
to explore problems such as this in the present 


paper. 
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COMMENT ON THE PAPER BY BLACK AND LANG 


KENDON SMITH 
University of North Carolina, Greensboro 


Brief comment is made on 
Lang (1963). 


My original note (Smith, 1963), to 
which Black and Lang (1963) have now 
replied, concerned itself solely with the 
question of whether or not curare drugs 
do in fact forestall in skeletal 
muscles, Even within this limited con- 
text, it is possible to comment on one 
or two of the points raised by Black 
and Lang. 

For example, I have no intention of 
defending the “logical” criticism of 
curare experiments—the 
negative evidence, in any quantity, is 
always inconclusive. We all know that 


appear that Black 
begun to develop such 
lf their electromyographic 
. demonstrate a consistent block- 
age of skeletal response by curare drugs, 
their point clearly will have been made. 

Black and Lang are concerned over 
the very definition of “skeletal response.” 


and Lang had 
ee 


* 


86 


the points and data presented by Black and 


Their concern has puzzled me, because 
I have used the term simply to mean 
“any activity of the skeletal muscles,” 
No doubt the term could be given one 
“operational definition“ or another. It 
seems much more to the point, however, 
to accept it at face value and then devise 
the operations required for detecting 
skeletal responses so defined. It would 
appear that the technique f electro- 
myography might be one sı 

The unaided observation 0 
sponse in the intact organism « e 
is not. ¥ pe 
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as general language. 


A few years ago the writers pre- 
"sented an analysis of aphasic test data 
in support of the hypothesis that there 
is a dominant dimension of general 
language deficit in aphasia (Schuell 
& Jenkins, 1959). Twenty-nine tests 
from the Minnesota Test for Differen- 
tial Diagnosis of Aphasia (Schuell, 
1955), judged to be primarily tests of 
language performance, were studied to 
see if a Guttman scale might be de- 
_Yeloped. Eighteen tests survived an 
‘initial screening by phi coefficients, 
and proved to constitute a homoge- 
“neous scale when scored on a pass-fail 
basis. The scale showed high re- 
producibility for the total sample of 
100 aphasic patients and for diagnostic 
‘subgroups treated separately. 

Two short scales were constructed, 
the first consisting of an abbreviated 
Scale of nine tests selected from the 

18 for high phi coefficients and wide 
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PSYCHOLOGICAL REVIEW 


RTHER WORK ON LANGUAGE DEFICIT IN APHASIA* 


HILDRED SCHUELL 


University of Minnesota Medical School 


The writers earlier proposed that language deficit in aphasia could be 
viewed as a general dimension, and showed that homogencous scales 
could be developed with tests selected from an aphasia battery. 
These scales were re-examined on a population of 157 aphasic patients 
and again found to meet the usual requirements for homogeneity. 
The 18 tests used on the scales were examined in the light of a factor 
analysis of 69 tests in the battery. They were found to load heavily 
on the Ist principal axis and on the Ist rotated factor, identified 
Results were consistent with the hypothesis of 
a dominant dimension of general language deficit in aphasia. 


range of difficulty; the second an 
a priori clinical scale of seven tests 
considered to represent an orderly pro- 
gression in recovery from aphasia. 
For all patient groups and scale as- 
semblies, the data yielded coefficients 
of reproducibility that were markedly 
higher than the minimum reproduci- 
bility indices. Cross validation on a 
small group of patients (N = 23) gave 
further confirmation of the results. 
The study was criticized by Jones 
and Wepman (1961) in connection 
with their findings on a factor analysis 
of aphasic test data. Their argument 
was that restricted selection of tests, 
the wide range of language impair- 
ment represented in the patients, and 
the method of analysis employed 
forced the appearance of a Guttman 
scale where separate language factors 
could have been found. Schuell and 
Jenkins (1961) attempted to clarify 
their position and sought a rapproche- 
ment with the findings of Jones and 
Wepman’s analysis, pointing out that 
differences in the tests employed, pa- 
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tient samples, and analysis procedures 
worked to minimize the likelihood that 
Jones and Wepman would find a 
general dimension of deficit even if 
it were present. At the conclusion 
of the reply the writers attempted to 
make their position clear with the 
following statement: 


We realistically expect that, over a large 
sample of aphasic patients, many 
of impairment resulting from brain damage 
are identifiable, and need to be studied, in 
addition to the common or general dimen- 
sion of language deficit. We further ex- 


pect that, at a given level of language 
deficit, language 


deficit present 
& Jenkins, 1961, p. 


What we are saying, in short, is that 
all aphasic patients show a reduction 
of language that cuts across language 
modalities. In this respect the aphasic 
can be compared to an individual who 
knows a limited number of words, ex- 
pressions, and permissible sequences in 
any given language. He will find 
himself at a loss whether he tries to 
listen, to converse, to read, or to write. 
In addition to this general loss, the 
aphasic may have additional perceptual 
or sensor-motor disabilities that affect 
some kinds of language behavior more 
than others. Luria ( 1958) has pointed 
out that it is these special kinds of 
impairment that make differential 
diagnosis possible. We are not saying 
that there is only one clinical aphasic 
syndrome. We are saying instead that 
the distinctive feature of aphasia is a 
reduction of language, Bay? makes 
the same point when he insists upon 
the necessity for differentiating be- 


Bay, E., Symposium on Language Dis- 
orders, CIBA Foundation, London, 1963, 


tween aphasia and dysarthria. Our 
data are here treated to make this 
point explicit, so that it will not be 
neglected in the discussion of observed 
patterns of aphasic impairment and 
in the discussion of other factors ap- 
pearing in factor analyses of aphasic 
performance. 

Data have been collected for 157 
patients on Form 6 of the Minnesota 
test, and a factor analysis of 69 tests 
in the battery has been performed 
(Schuell, Jenkins, & Carroll, 1962). 
These new data permit a re-examina- 
tion of the Guttman scales and at the 
same time make possible a study of 
the factorial composition of the tests 
comprising the scales. 


METHOD 
Subjects 
Subjects were 157 aphasic patients. Of 
these, 155 were patients hospitalized on the 


Neurology Service of the Minneapolis Vet- 
erans Administration Hospital between 
June 1955 and June 1958. These subjects 
were consecutive admissions to the Aphasic 
Section of the Neurology Service, except 
for the following exclusions: patients who 
were neurologically unstable; patients with 
a psychiatrically confirmed diagnosis of 
psychosis or regression; patients who re- 
sponded to no test items; and patients who 
were evaluated for diagnostic purposes, but 
were not considered aphasic. In order to 
increase the number of patients in one 
diagnostic group, records of two aphasic 
patients tested by Schuell at Syracuse Uni- 
versity, and made available through the cour- 
tesy of Louis di Carlo, were included. 
Patients were distributed as follows in 
the diagnostic-prognostic groups: 3 


Group I (N=21), Simple aphasia with- 
out perceptual or motor involvement. Audi- 


These are the same clinical groups as 
those Specified in more detail in the earlier 
article. The numbering of the groups has 
been changed as Part of the effort to make 
the classification system more logical and 
easier to remember. The prognosis for re- 
covery is roughly parallel to the order of 
the group numbers, 


9 


ry retention span and auditory recall were 
paired with impairment reflected in de- 
Mive speech, reading, and writing. Speech, 

f and writing improved simultane- 
misly when intensive auditory stimulation 
yas given. These patients made excellent 


oup II (N =27), Aphasia with cerebral 
vement of visual processes. Patients 
like those in Group I but showing 
onal impairment of visual discrimina- 
recognition, and recall. They con- 
letters and words with similar visual 
urations, and tended to spell phoneti- 
ally. Most Group II patients had visual 
field defects. Patients recovered speech well 
] reading and writing improved much 
slowly. 


Group III (N=18), Severe aphasia 
With sensorimotor involvement. Patients 
y showed severe impairment of audi- 
rentention span and recall. They 
consistent articulation errors and 
h tended to be labored and hesitant. 
h was functional when acquired but 
mained limited and defective over long 


| Group IV (N=65), Aphasia with some 
language and involvement of both 
and motor processes. Severity of 
olvement of each process varied from 
nt to patient, producing superficially 
erent clinical pictures. There was usu- 
some weakness of the speech muscu- 
ure. Although a good deal of language 
often retained in one modality or an- 
er, patients tended not to be capable of 
i or self-directed effort. Func- 
speech could be increased or made 
intelligible in many cases but only 
ited goals were achieved. Neurological 
were compatible with scattered or 
i alized brain damage. 

Group V (V= 23), Irreversible aphasia 
with almost complete loss of functional 
angnage skills. Patients in this group had 
functional speech, reading, or writing. 
ey made errors pointing to common ob- 
named by the examiner. None of 
acquired functional speech, although 
usually learned to repeat and copy, and 
me reactive speech appeared. 
Three patients did not appear to be ade- 
tely characterized by the above classifica- 
They showed partial auditory im- 
ae with some seus pea 
t, although testing revealed rather se- 
language deficits in all modalities. 


ined 
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Tests 


The 18 tests which constituted the pre- 
viously scale are listed below by 
descriptive titles. More detailed descrip- 
tions may be found in Schuell and Jenkins 
(1959). The tests were given individually 
as part of the diagnostic testing program 
on admission to the aphasia clinic. 


1. Point to items named 

2. Point to letters named 

3. Follow spoken directions 

4 to sentences, indicate right or 


5. Listen to long paragraph, indicate yes 
no question on content 

6. Match words with pictures 

7. Repeat monosyllabic words 


8 


11. Name objects, pictures 

12. Give rhyming words 

13. Give biographical information in re- 
sponse to questions 

14. Tell three things done during day, 
and three things a good citizen should do 
15. Describe pictured situation 

16. State similarity between pairs 

17. Write letters dictated in random order 
18. Write dictated sentences 

For purposes of scaling, cach test was 
treated as a single pass-fail item, and 
scored as failed if the patient made an 
error, For the factor analysis the test 
scores were dichotomized as close to 50% 
difficulty as possible to permit calculation 
of tetrachoric correlations. 


RESULTS 
Plan of Analysis 


It was decided to re-examine the 
largest and the smallest of the scales 
developed previously, the 18-test scale 
and the 7-test scale. Each of these 
scales was tested against the full sam- 
ple of asphasic patients and against pa- 
tients in the three diagnostic subgroups 
separately studied in the earlier paper. 
It should be noted that this re-exami- 
nation of the scale does not consti- 
tute a bona fide cross validation of the 
original scaling because 45 of the pa- 


tients in the present series were mem- 
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Scale 


Eighteen test 


Sample 
Total, N = 157 
Group I, N = 21 
Group II, N = 27 
Group IV, N = 65 


Clinical scale Total, N = 157 
| Group I, N = 21 


Group II, N = 27 


TABLE 1 
8-Test SCALE AND THE CLINICAL SCALE 
feel reproducible Minimum reproducibility 
88.3 69.7 
87.3 T7 
83.1 71.6 
87.5 22:1 
90.0 69.4 
90.5 70.8 
86.2 69.9 
88.6 69.6 


Group IV, N = 65 


bers of the original patient sample on 
which the scales were developed. 
Thus, 29% of the current group en- 
tered into the original determination 
of reproducibility of the scales. 
Rather than eliminate this group of 
patients, it was decided that the maxi- 
mum sample should be kept so that 
the comparison with the factor analy- 
sis would be as complete as possible. 

Next, the tests making up the scales 
were examined with reference to the 
findings of the factor analysis of 69 
tests from the full aphasic battery for 
the same sample of patients. 


Re-examination of the Scales 


Subjects were 


given total scores con- 
sisting of the 


number of tests they 


tion of individual 
plotted. The coe 
bility for the ma 
mined, 


Passes and fails was 
fficient of reproduci- 
trix was then deter- 


the values fou 
and, with the 
for Group II, 


accepted level taken to demonstrate 
the existence of a homogeneous scale. 
As was pointed out in the original 
study, the lowered efficiency of the 
scales in Group II may be attributed to 
the fact that this group has visual 
losses ranging from mild to severe 
in addition to language loss, and thus 
Scores more poorly on tests which 
make visual demands in addition to 
language demands than the language 
component of the tests alone would 
suggest. 

Overall, in spite of the complex pat- 
terns of visual, perceptual, motor, and 
Sensory motor losses which frequently 
accompany aphasia, it appears that the 
severity of language loss itself is scal- 
able, and may be thought of as a rela- 
tively homogenous continuum, 


Relation of the Scales to Factor 
Analytic Findings 


The factor analysis, reported in de- 
tail in Schuell, Jenkins, and Carroll 
(1962) began with a 69 x 69 matrix 
of tetrachoric correlations between the 
tests in the batt 
Subjected to a 


Principal components 
analysis. Eight 


Principal axes ex- 
hausted 99% of the variance to be ac- 
counted for. These eight axes were 
used as the basis for subsequent rota- 
tions. Two analytic rotations were 
performed by the computer, an orthog- 
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TABLE 2 


FACTOR LOADINGS OF 18 TESTS ON PRINCIPAL COMPONENTS AND VARIMAX ANALYSES 
OF THE APHASIA TEST BATTERY 


Principal axes Varimax factors 
oe | passing - 
1 2 3 4 5 1 2 3 4 5 
1* | 66.9 82 —13 32 | —34 10 | 49 38 40 03 61 
2 38.9 91 10 09 07 | —06 | 66 58 19 16 22 
3 35.7 91 19 24 02 | —14 | 71 59 16 | —01 26 
4 33.8 79 —01 | —09 | —12 | —25 | 59 33 34 21 20 
5 14.0 74 13 | —05 | —04 | —14 | 61 36 19 17 13 
6 66.2 81 —25 17 | —26 | —04 | 45 35 54 07 43 
™ | 34.4 87 20 | —30 08 02 | 82 20 22 34 05 
8* | 44.6 88 38 | —08 | —14 | —03 | 93 22 17 07 13 
9 46.5 90 34 | —13 02 09 91 28 19 12 | —05 
10* | 41.4 83 49 | —08 | —19 00 | 96 16 06 07 14 
1i* | 34,4 90 34 07 | -17 1491 26 20 00 19 
12 19.7 82 24 | —31 05 07 | 84 17 20 22 | —07 
13 47.1 89 39 00 | —12 02 91 29 10 06 18 
14* | 24.8 87 33 | —16 | —07 09 | 90 21 19 16 00 
15 19.1 87 38 | —09 | —16 15 | 93 17 14 11 16 
12.1 88 39 | —03 | —07 11 | 91 27 07 08 16 
17 23.6 88 —10 13 20 | —15 | 49 73 31 18 10 
18 3.2 92 05 12 26 | —02 | 64 68 29 07 | —08 


* Seven tests on the clinical scale. 
onal solution (Varimax) and an 
oblique solution (Oblimin), and 
graphical solutions were also ob- 
tained. Analyses resulted in virtually 
identical factor structures showing 
five major factors and two minor fac- 
tors identified by only one or two tests. 
The Oblimin analysis showed the ma- 
Jor factors to be moderately intercor- 
telated with coefficients ranging from 
14 to 61. 

The factors may be briefly described 
as follows: 


Factor 1. Language behavior. 
Tests involving language functions in 
all modalities showed high loadings on 
this factor. The factor accounted for 
41% of the variance in the battery. 

Factor 2. Visual discrimination, 
recognition and recall. Reading and 
Writing tests obtained high loadings 
on this factor. In the oblique analysis 
it was found to correlate most highly 
with Factor 1 (r=.61). 


Factor 3. Visuo-spatial behavior. 
This factor involved both visual and 
sensory processes and correlated most 
highly with Factor 2 (r=.55) and 
Factor 4 (r = .58). 

Factor 4. Gross movement of the 
speech musculature. This factor ap- 
peared to reflect paralysis or paresis 
of the speech musculature. It in- 
cluded difficulty in swallowing as well 
as in imitating movements of tongue 
and jaw, and palatal involvement. 

Factor 5. Recognition of stimulus 
equivalence. This factor was consid- 
ered to reflect integrative processes 
involved in taking account of stimu- 
lus equivalence, both within and across 
language modalities. 


Overall, it may be seen that a pat- 
tern of related factors with a domi- 
nant central language factor charac- 
terizes the relations between the test 
scores. This outcome is clearly what 
would be expected on the hypothesis 
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of a major dimension of language 
deficit in aphasia. 

Table 2 reports the loadings of the 
18 tests used in the 
scales on the first five principal axes 
and on the five major factors on the 
Varimax rotation. 

Study of the table shows that the 
relation of the scaled tests to the 
factor structure is somewhat like that 
found by Gage (1947) in his attitude 
study. Gage developed a homogeneous 
scale and performed a factor analysis 
on a pool of attitude items, He found 
that items which proved to be scalable 
were those which had the highest 
loadings on the first unrotated factor 
and on the first rotated factor. Items 
which had lower loadings than the 
scaled items on his unrotated first 
factor tended to shift off to other fac- 
tors in the rotation. 

With respect to the principal com- 
ponents in the present study, it is ap- 
parent that all of the tests in the scale 
have very high loadings on the first 
axis (median loading .87) and very 
small loadings on the remaining axes, 
The tests are heavily saturated with 
whatever is common to the entire bat- 


the 18-test homoge: 
| neous scale 
i heavily Saturated 
the rotated’ first factor which 
we have called Seneral language. The 
tests on the 
factors are 83, 


* most heavily else- 


In the entire battery of 69 tests, 
there are 16 tests which load above 
80 on the general language factor. 
Of these, 10 are included in the 18- 
test scale and six of the 10 appear in 
the seven-test clinical scale. Simi- 
larly in the entire battery there 
are 10 tests with loadings of .90 or 
above on the general language fac- 
tor. Of these, eight appear in the 
18-test scale and five from them ap- 
pear on the clinical scale. 

The four tests on the 18-test scale 
with the lowest loadings on the gen- 
eral factor appear to be at the ends 
of the difficulty distribution ; they con- 
sist of the two easiest tests of the 18, 
the most difficult test, and the sixth 
most difficult test. This suggests the 
reasonable conclusion that the items 
making up a Guttman scale may well 
be weak with respect to homogeneity 
of the scale when they are at the 
extremes of the distribution, without 
being detected by the procedures of 
scale construction. However, the pres- 
ent findings may also be due to the 
fact that the Pass-fail points are dif- 
ferent for the scale analysis and the 
factor analysis, as specified above. 
Thus, in some of the extreme tests 
in the scale, we may have been deal- 
ing with very poor or excellent per- 
formance in which language was most 


tected other components of perform- 
ance present in the mid-difficulty range 


Discussion 
The results given above indicate 
that the scales which were developed 
Study perform ade- 


acceptance of any simple notion of 
aphasia as varying only in amount. 
As we pointed out earlier (Schuell 
& Jenkins, 1959) the degree of gen- 
eral language deficit does not by itself 
urnish an adequate diagnosis, sug- 
gest effective modes of treatment, or 
edict the extent of recovery. What 
shown by these studies is that 
simple expressive-receptive, motor- 
Sensory, or other elementary classifica- 
tions of aphasia are inadequate. There 
is a general language deficit in aphasia 
vhich is nonspecific to modalities, 
s or functions. This general deficit 
Must be recognized in any description 
the disorder. Students of aphasia 
ce Weisenberg and McBride 
(1935) have come increasingly to this 
clusion, but the level of acceptance 
this basic finding by neurologists 
psychologists has lagged far be- 
ind. Findings of our current re- 
arch program argue overwhelmingly 
recognition of the existence of a 
jon inant general language deficit in 

asia. 
Re- examination of the homogeneous 
ales of language deficit reveals that 
y perform consistently as expected. 
en examined in the light of a fac- 
analysis of the entire aphasia test 
ery, the tests used in the homo- 
neous scales proved to be heavily 
rated with the first principal com- 
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ponent of the battery and with the 
first rotated factor, general language. 
Tests at the extremes of difficulty of 
the homogeneous scales appear to be 
less reliably related to the general 
language factor obtained on the factor 
anlysis than tests at the midrange of 
difficulty. Results are consistent with 
the hypothesis of a dominant dimen- 
sion of general language deficit in 
aphasia. 


REFERENCES 


Gace, N. L. Scaling and factorial design 
in opinion poll analysis. Purdue Univer. 
Stud. higher Educ., 1947, 61. 

Jones, L. V., & Wepman, J. M. Dimen- 
sions of language performance in aphasia. 
J. speech hear. Res., 1961, 4, 220-232. 

Luria, A. R. Brain disorders and language 
analysis. Lang. Speech, 1958, 1, 14-34. 

Scuuett, Hirpren. The Minnesota Test 
for Differential Diagnosis of Aphasia. 
(Rev. ed.) Minneapolis: University of 
Minnesota Printing Department, 1955. 

ScHvett, HILDRED, & JENKINS, J. J. The 
nature of language deficit in aphasia. 
Psychol. Rev., 1959, 66, 45-67. 

Scuuett, HDE D, & Jenkins, J. J. Com- 
ment on dimensions of language perform- 
ance in aphasia. J. speech hear. Res., 
1961, 4, 295-299. 

Scuuett, HIL D RED, JENKINS, J. J., & Car- 
ROLL, J. B. A factor analysis of the Min- 
nesota Test for Differential Diagnosis of 
Aphasia. J. speech hear. Res., 1962, 5, 
349-369, 

WEISENBERG, T., & McBrive, KATHARINE. 
Aphasia, a clinical and psychological study. 
New York: Commonwealth Fund, 1935. 


(Received January 31, 1963) 


. Ne. . 94-103 


MENET 


THE BLANK TRIALS LAW: 


THE EQUIVALENCE OF POSITIVE REINFORCEMENT 
AND NONREINFORCEMENT 


M. LEVINE, H. LEITENBERG, ann M. RICHTER 


Indiana University 


A theorem equating behavior after E says “right” 


after blank trials 


Levine (1963) has recently de- 
scribed a model of hypothesis behavior 
y humans trying to solve simple dis- 
crimination problems, The model 
compares behavior during two types 
of problems: Outcome and Nonout- 
come problems. An Outcome prob- 
lem refers to the standard discrimina- 
tion procedure in which stimuli are 
Presented for a number of trials, the 
subject must make one of two choice 
responses after each stimulus presen- 
tation, and the experimenter presents 
an Outcome, eg., says “right” or 
“wrong” after 
outcome problem follows exactly the 


outcomes 


that the Probability of 
repeating their response 
to Trial 2 during Non- 


the subjects 
from Trial 1 


response during Outcome problems 
lowing the first 


lt is becoming conventional (Koehler, 
1961) to refer to trials with 


owever, consisting on 


referred to as a 
Problem. 
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with behavior 


trial is a “right.” In mathematical 
form (letting R = repeated response, 
N = Nonoutcome problem, O = Out- 
come problem, + = “Right” after 
first response) : 


P(R|N) = P(RION+) 


Another way of Stating this is: Dur- 
ing a Nonoutcome problem the sub- 
ject behaves as though the experi- 
menter were saying “right” 

Whereas the theorem was derived 
from a model describing behavior only 
over the first two trials of a discrimina- 
tion problem, it is apparent that the 
theorem might have greater general- 
ity. That is, although the model is 
not developed to describe behavior 
throughout, &g., a six-trial Outcome 
problem, one can still inquire as to the 
applicability to such a problem of the 
statement that “during a Nonoutcome 
problem the subject behaves as though 

i T Were saying right.“ 
Was evaluated in four 
The first, which was in- 
irect test of the model, 
Series of discrimination 
exactly two trials in 
length, The other three experiments 
simply of the generality 
They consisted in a 
d double-alternation 
es of contingent-dis- 


experiments, 
tended as a q 
consisted in a 


series of modifie 
problems, a seri 
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crimination problems, and a single 40- 
trial guessing problem. 


EXPERIMENT I: Two-rTRTIAL Dis- 
CRIMINATION PROBLEMS 


This experiment has been described 
in detail elsewhere (Levine, 1963) 
and will be summarized only briefly 
here. A learning-set experiment con- 
sisting of 90 two-trial problems was 
presented under four experimental 
conditions to four groups of 20 sub- 
jects. The four conditions differed 
along a continuum defined by the ex- 
perimenter’s manner of reinforcement. 
At one end of this continuum is the 
standard learning-set procedure in 
which experimenter selects the stimu- 
lus to be correct, and reinforces re- 
sponses only to it on each trial of the 
problem. This procedure holds for 
every problem in the experiment. That 
is, the correct stimulus is the same 
for both trials of 100% of the prob- 
lems. The group receiving this con- 
dition will be referred to as G-100 

At the other end of the continuum 
is the alternation learning-set pro- 
cedure in which the stimulus desig- 
nated as correct by experimenter alter- 
nates from Trial 1 to Trial 2. 

The correct stimulus is never the 
same for the two trials of any prob- 
lem, or, conversely, is the same for 
0% of the problems. The group re- 
ceiving this condition will be referred 
to as G-0. In between the two ex- 
tremes experimenter may follow one 
Procedure or the other for as many 
Problems as he wishes. For the two 
remaining conditions the correct stimu- 
lus was the same during 80% of the 
Problems, changing during the re- 
mainder (G-80), and the correct 
Stimulus was the same during 20% 
of the problems, changing during the 
remainder (G-20). The four groups, 


then, were G-100, G-80, G-20, and 
G-0, where the number describes the 
percentage of problems on which ex- 
perimenter caused the correct stimulus 
to perseverate. 


Method 


Apparatus. The stimuli were 180 differ- 
ent three-letter nonsense syllables selected 
from Glaze’s (1928) lists. Pairs of syllables 
were randomly selected to form the stimu- 
lus objects for a given trial, and were typed 
5 inch apart on a 3X5 card. A pair of 
cards containing the same two syllables 
served as the materials for a problem, A 
deck of 90 such pairs of cards was con- 
structed, 

Procedure. Each subject was shown a 
sample 3X 5 card containing two syllables 
and was instructed that he would receive 
a deck composed of similar cards, that he 
was to proceed through the deck one card 
at a time, and that he was always to choose 
one of the two syllables on each card. He 
was further told that the experimenter 
would say “right” or “wrong” after each 
choice, and that he was to try to be right 
as often as possible. These will hereafter 
be referred to as “standard instructions.” 

After either 5 or 10 Outcome problems, 
the experimenter stopped the subject and 
instructed him that there would now be a 
test of how much had been learned thus 
far. The subject was told that during the 
next few cards the experimenter would not 
say anything, that because this was a test 
he was to try to get 100% correct. The 
subject then proceeded to choose syllables 
on the next 10 cards (5 problems) with 
the experimenter saying nothing. After 
these 5 problems the experimenter an- 
nounced that the learning would be re- 
sumed and presented outcomes during the 
next 10 problems. Test instructions then 
followed, etc. Ten Outcome problems con- 
tinued to alternate with 5 Nonoutcome prob- 
lems until 90 problems—60 with, 30 with- 
out outcomes—had been presented. 


Results 


The proportion of Nonoutcome 
problems during which the subjects 
selected the same syllable on both 
trials is compared in Table 1 to the 
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TABLE 1 


THE PROFORTION OF PROBLEMS IN WHICH THE 
RESPONSES WERE THE SAME ON 


Born Triacs 
7 | 7 Group * 
co | G20 | cso 100 
P(RIN) E 24 43 81 903 


P(RION +) | 28 40 83 92 


corresponding proportion during those 
Outcome problems on which the ex- 
perimenter said “right” after the first 
response. 

The results in this table are typical 
of those found by more detailed 
analyses: there is little difference be- 
tween the corresponding pairs of 
proportions. The differences that 
were obtained were not significant. 
For the four conditions described by 
the four groups the theorem is clearly 
tenable. 


Expermment II: Moprrixp DOUBLE- 
ALTERNATION 


In this experiment all subjects re- 
ceived a series of four-trial problems, 
each of which utilized a different pair 
of stimuli. On any Outcome problem 
the experimenter nonsystematically se- 
lected one of the two stimuli to be 
correct on Trial 1; on Trials 2 and 
3 the other stimulus was always cor- 
rect; on Trial 4 the stimulus correct 
on Trial 1 was again correct. In 
short, the correct stimulus followed the 
pattern of abba.* 


This is not what is typically meant by 
“double alternation.” This term is used 
for problems in which the correct stimulus 
follows an (AaBB) pattern. Preliminary 
work showed that this latter problem series 
was too easy for college subjects to yield 
a varied distribution of response sequences. 
The Appa pattern is the double-alternation 


On special test problems the group 
of subjects was subdivided, one sub- 
group receiving a Nonoutcome prob- 
lem (experimenter says nothing during 
all four trials), the other subgroup 
receiving a (4+) problem (experi- 
menter says “right” after all four 
trials no matter which of the two re- 
sponses subject makes). The theorem 
says that the distribution of obtained 
response sequences should be identical 
under these two conditions. 

This comparison is the critical test 
of the theorem. It is possible, how- 
ever, that the same distribution of re- 
sponse patterns would appear no mat- 
ter what outcomes were presented. In 
order to check on this possibility the 
distribution obtained during a Non- 
outcome problem was occasionally 
compared to the distribution produced 
when the experimenter said “wrong” 
on a prescheduled basis. 


Method 


Subjects. The subjects were 100 students 
from the introductory psychology courses 
at Indiana University. 

Apparatus. The stimuli were 100 dif- 
ferent three-letter nonsense syllables selected 
from the 80%-93% sections of Glaze’s 
(1928) lists. Pairs of syllables were 
randomly selected without replacement to 
form the stimuli for a given problem and 
were typed on four 3X5 cards. The two 
syllables were typed in capital letters 1.5 
inches apart on the horizontal center line 
of each of the four cards, The positions 
of the syllables were randomized, with only 
the restriction that the syllables did not 
maintain the same positions on all four 
cards. Four such cards constituted the ma- 
terials for a problem; a deck of 50 four- 
card sets constituted the materials for the 
experiment. 

Procedure. The subject received the 
standard instructions described in Experi- 
ment I, was then given the deck, and was 
instructed to begin. After each trial subject 
Placed the card face down. 


Pattern started one step later, hence the 
adjective “modified.” 


The correct stimulus on the four trials 
bi each of the first eight problems followed 
an abba pattern. The syllable defined as 
A (ie, correct on Trials 1 and 4 but not 
on Trials 2 and 3) had been randomly se- 
eted in advance. All subjects were treated 
ically through the eighth problem. On 
lem 9 the subject received one of two 
treatments depending upon the subgroup to 

h he had been assigned before the 
iment began. The 50 subjects of Sub- 
oup 1 now received a (4+) problem. 
out any announcement or indication 
of change in procedure the experimenter 
E “right” no matter which response the 
subject made during the four trials of this 
oblem. The 50 subjects of Subgroup 2 
ved first these instructions : 


For these next few cards I am going 
to test you on what you have been learn- 
‘ing. I want to see if you have learned 
enough to get the right word without my 
telling you whether you are right or 
wrong. This is a test of learning so try 
to get 100% correct. 


y then received a Nonoutcome prob- 
for the ninth problem—the experimenter 
aid nothing during these four trials. 

The subjects’ procedures were reversed 
the tenth problem, ie, Subgroup 1 
ived the instructions and the Nonout- 
me problem, and Subgroup 2 received 
he (4+) problem. 

The first 10 problems, then, consisted of 
eight standard Outcome problems of abba 
olution patterns followed by a (4+) and 
Nonoutcome problem on Problems 9 and 
0 for one subgroup and the reverse se- 
nce on Problems 9 and 10 for the other. 
n general, the experiment cosnisted of five 
10-problem series. The first, third, 
nd fifth series ended in a (4+) and Non- 
come problem; the second and fourth 
es ended slightly differently. Instead 
f a (4+) procedure for the ninth (or 
th) problem a (1—, 3+) procedure was 
In this, experimenter says 
first trial and “right” on 


= 


; There are eight response sequences 
Which a subject may manifest on any 
roblem. For example, a subject may 
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PERCENTAGE OF PROBLEMS 


Fic. 1. Distribution of response patterns 
during (4+) and Nonoutcome problems 
(left-hand graph), and during (1—, 3+) 
and Nonoutcome problems (right-hand 
graph). 


single-alternate his selection of syl- 
lables for the four trials (i.e., show 
abab), he may double-alternate (show 
aabb), perseverate (aaaa), etc. For 
each of the three types of problems the 
frequency distribution of the eight re- 
sponse patterns may be plotted and 
compared. The theorem says that the 
distribution from the Nonoutcome 
problems should be the same as that 
from the (4+) problems. It does not 
specify the relation of either of these 
to the distribution from the (1—, 
3+) problems, but it implies that the 
latter should be different. The com- 
parisons are made in Figure 1. The 
left side of Figure 1 shows the com- 
parison of data from Nonoutcome and 
(4+) problems from the first, third, 
and fifth series combined. It will be 
seen that the match is exceedingly 
close: the only significant difference 
is for pattern aaab (p = 04 by a t- 
test for differences between propor- 
tions). 

The similarity of this pair of dis- 
tributions is emphasized by the con- 
trast in the other two distributions. 
The right side of Figure 1 compares 
the distributions from Nonoutcome and 
(1-, 3+) problems from the second 
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and fourth series combined. The 
theorem implies that these distributions 
should not be alike, and they are 
clearly different. Three of the differ- 
ences (for patterns abba, aaab, and 
abab) are significant well beyond the 
the .01 level. Thus, the similarity 
between the Nonoutcome and (4+) 
distributions is produced by the par- 
ticular affinity of the two outcome 
operations involved (saying nothing 
versus saying right“) rather than by 
any stereotyped response sets by the 
subjects. 


EXPERIMENT III: CONTINGENT 
DISCRIMINATION * 


In the contingent discrimination ex- 
periment two stimuli are presented, 
one of which has been defined as cor- 
rect by the experimenter. The selec- 
tion of a stimulus as correct depends 
(is contingent) upon the value of some 
other dimension. For example, the 
background upon which the stimuli 
are placed may be one of two colors 
(eig., red or blue), with the following 
tule defining correctness: if the back- 
ground is red then Stimulus A is 
correct; if the background is blue 
then Stimulus B is correct. This re- 
lationship holds throughout the prob- 
lem. The subjects received a series 
of such problems, each problem last- 
ing for exactly four trials. 


Method 


Subject. The subjects were 120 students 
from the introductory psychology courses 
at Indiana University. 

Apparatus. Seventy-two different three- 
letter nonsense syllables (selected from the 
80%-93% association sections of Glaze’s 
lists, 1928) formed the stimuli for 36 
problems. A problem was constructed by 


3 We are grateful to Mymon Goldstein 
for suggesting this problem as suitable for 
evaluating the generality of the theorem. 


selecting two syllables from this set and 
typing them on four cards, two of which 
were one color and two of which were a 
different color. The cards were selected 
from six colored sets of 3X5 cards 
(colored blue, green, yellow, orange, red, 
or white). The two colors employed for 
a given problem were randomly selected 
with the restriction that the same color not 
appear in two successive problems. 

The two syllables were typed 1.5 inches 
apart in capital letters on the horizontal 
center line of the card. The position of 
the syllables was randomized with each 
syllable appearing twice on the left and 
twice on the right. The card colors either 
single-alternated or double-alternated over 
the four-trial problem. 

Thirty-six four-trial problems, each 
utilizing two new nonsense syllables and 
two colors, were constructed and put into 
sequences forming a deck of 144 cards. 

Procedure. The subject received the 
standard instructions described in Experi- 
ment I, He was then given the deck and 
was instructed to begin. After each trial 
the subject placed the card face down. 

All subjects received five four-trial Out- 
come problems in which the correct non- 
sense syllable was always contingent upon 
the color of the card. During the sixth 
problem the subject received one of three 
different conditions according to the sub- 
group to which he had been assigned. One 
condition was a Nonoutcome problem. At 
the end of Problem 5 these subjects were 
instructed as follows: 


For these next few cards I am going to 
test you on what you have been learning. 
I want to see if you have learned enough 
to get the right word without my telling 
you whether you are right or wrong. 


Problem 6 was then presented with the ex- 
perimenter saying nothing. The second 
condition was a (4+) problem. Without 
any announcement or change in procedure 
the experimenter said “right” no matter 
which response these subjects made during 
the four trials of Problem 6. The third 
condition was a (1—, 3+) problem. The 
experimenter said “wrong” to the first re- 
sponse followed by three “rights” to the 
following responses no matter which re- 
sponse these subjects made during the four 
trials of Problem 6. 

The first of six problems, then, consisted 
of five Outcome problems of a contingent 
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discrimination type followed by a test 
problem which could be either a Nonout- 
come, a (4+), or a (1—, 3+) problem. 
The entire experiment consisted in six cycles 
of such a six-problem series. Each subject 
therefore received six test problems. Dur- 
ing the first three and again during the 
last three test problems, he received each 
of the three test conditions. The conditions 
were organized for every three subjects to 
form a Latin square with Tests 1, 2, and S, 
and again with Tests 4, 5, and 6. 


Results 


For each of the three types of test 
problems the frequency distribution of 
the eight response patterns may be 
plotted and compared. These com- 
parisons are made in Figure 2. This 
figure shows three frequency polygons 
representing the relative frequency 
distributions of the response patterns 
for the three conditions. The data 
are combined for all 120 subjects on 
the two test problems of a given 
condition. Each curve, therefore, is 
based upon 240 problems. It will be 
seen that as specified by the theorem, 
the match between the Nonoutcome 
condition and the (4+) condition is 
exceedingly close. It is equally clear 
that, again, the (1—, 3+) condition 
yields an entirely different type of 
distribution. 


EXPERIMENT IV: THE GUESSING 
PROBLEM 


Thus far, the investigations of the 
theorem have utilized the learning-set 
procedure, i.e., the presentation of a 
train of similar problems. Also, se- 
quences of only two (Experiment I) 
or four (Experiments II and III) 
test trials were used. It is possible 
that difficulties might arise with larger 
numbers of test trials. For example, 
subjects receiving “blanks” for a longer 
series of trials may start to manifest 
less well organized behavior. Or, pos- 
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Fic. 2. Percentage of problems on which 
the eight response sequences occurred dur- 
ing each of the three conditions. (The re- 
sponse sequences are ordered according to 
the frequency during the Nonoutcome 
problems.) 


sibly, subjects receiving all “rights” 
may, because of the cumulative re- 
inforcing, ie., strengthening, effect, 
start to show a stronger tendency 
toward repetition of the same re- 
sponse. The theorem, of course, 
specifies that neither “blanks” nor 
“rights” should have a unique effect 
or, conversely, that both should yield 
the same behavior. In order to put 
the theorem to a more stringent test, 
as well as to extend its generality, the 
next application was over 10 trials em- 
bedded within a single problem. 

The problem selected was a varia- 
tion of the Humphreys (1939) guess- 
ing problem. The subject was re- 
quired to make one of two responses 
over a series of 40 trials. The stimu- 
lus situation was constant on all trials, 
i.e., there were no variations in stimuli 
from trial to trial upon which subjects 
could make responses contingent. There 
was, however, a contingency upon the 
preceding outcome. Two contingencies 
were employed. In one, the proba- 
bility that the response correct on 
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FIRST PROBLEM 


NUMBER OF REPEATS 


Trial n was the same as that correct 
on Trial n-1 was .8; in the other, that 
probability was .2. The former tended 
to produce long runs of similar correct 
events; the latter tended to produce 
short runs, i.e., alterations, of the 
correct event. 

The subject received one of these 
procedures for 30 trials. This was 
followed either by 10 blank trials (ex- 
perimenter always said nothing) or by 
(10+) trials (experimenter always 
said “right” regardless of subject’s re- 
sponses). The patterns of respond- 
ing obtained under the blank and the 
(10+) conditions should be com- 
parable, according to the theorem. 


Method 


Subjects. The 120 subjects of Experi- 
ment III also served here, They received 
the instructions for this experiment follow- 
ing their performance in Experiment III. 

Apparatus. Two sets of 40 cards were 
constructed. In one set the nonsense syl- 
lables cep and NIF were typed on all 
40 cards in identical fashion (both syllables 
on the center horizontal line, one inch 
apart, CED always on the left). In the 
other set the syllables pum and por were 
typed in the same manner (rum always 
on the left). 

Procedures. The subjects were divided 
into four categories of a 2X2 factorial 
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30 SECOND PROBLEM 


5 6 
TRIAL 
Fic. 3. Number of responses on Trial N which are repeats of the responses r 


on n-l. (The solid lines are for subjects who were told “right” after each response; 
dashed lines are for subjects who were told nothing after each response.) 


design. The two variables were (a) the 
probability that the same syllable was cor- 
rect on two successive trials: this proba- 
bility was either 2 or .8 during the first 
30 trials; (b) the outcome conditions dur- 
ing the last 10 trials: the subjects received 
either the blank trials or the (10+) trials 
during Trials 3140. Thirty subjects were 
assigned to each of the four categories. 

The subject received the standard in- 
structions described in Experiment I. Dur- 
ing Trials 1-30 the experimenter presented 
outcomes after each response. Right and 
wrong responses were defined in advance 
according to the group (.2 or .8) that the 
subject was in. Within a group the se- 
quence of responses defined as correct was 
identical for all subjects. 

After Trial 30 each subject was told, 
“The next few trials will be critical trials, 


For example, all subjects who had re- 
ceived the .8 and the (10+) conditions on 
the first problem, received the .2 and the 
blank conditions on the second problem. 
All other details of the procedure were 
identical to the procedure for the first 
problem. 
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Results 


For Test Trials 2 through 10 the 
frequency with which the subjects made 
the same response as on the preceding 
trial was determined. The results for 
all four conditions of both problems 
are presented in Figure 3. It will be 
seen there that with the exception of a 
couple of places the blank and (10+) 
conditions yield very similar results.* 
The overall levels are comparable for 
each of the four pairs of curves, and 
the trial-to-trial trends tend to be 
alike, particularly on the second prob- 
lem. Also, the magnitude of the dif- 
ferences do not seem any greater to- 
ward the end of the 10 trials than at 
the beginning. That is, there does 
not appear to be any systematic dif- 
ferential drift produced either by the 
10 “rights” or 10 “blanks” alone. In 
general, the results closely fit the 
theorem. 

One aside is worth recording here. 
All 120 subjects received the condi- 
tion of 10 “rights.” Not one indi- 
cated any suspicions about the 
experimenter. Most, on subsequent 
questioning, expressed the idea that 
they “caught onto the sequence at the 
end.“ 


DISCUSSION 


The finding has been consistently 
confirmed in four differently struc- 
tured experiments that the behavior 
of subjects during trials when no out- 
comes are given is the same as the 
behavior of subjects when the experi- 
menter says “right” following each re- 


4 Trials 5 and 9 of the .8 condition in the 
first problem provide the only pairs yielding 
significant differences (.01<p<.05 for 

pairs of points). It should be re- 
membered, however, that 36 comparisons 
were made. The probability of finding two 
or more significant differences out of 
tests is well above .05. 
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sponse. Considering the crucial role 
attributed to “positive reinforcement” 
for altering response probabilities in 
contemporary theories of learning it 
is strange to find that the absence 
of reinforcement seems to yield exactly 
the same changes in response proba- 
bilities. “Right” has typically been 
treated as fixating“ a response, as 
making that response more likely to 
occur. Why has this effect not ap- 
peared in these experiments? 

Levine's (1963) model, from which 
the blank trials theorem was derived, 
suggests one solution to this problem. 
One relevant assumption in the model 
is that the unit of behavior directly 
affected by the outcome procedure is 
the subject's hypothesis (a mediating 
process defined as a prediction about 
what constitutes solution behavior) 
rather than his specific choice re- 
sponse. The hypothesis held produces 
either responses to selected cues (e.g. 
to nonsense syllables rather than po- 
sitions in the above experiments) or 
the sequential organization of choice 
responses (eig., alternation, win-stay- 
lose-shift, etc.) or both. The model 
also stipulates that each hypothesis is 
manifested in a specific response se- 
quence, and that both “rights” and 
blank trials produce no changes in the 
subject’s hypothesis and, therefore, no 
difference in the sequence of responses 
manifested. In contrast, “wrong” 
causes the subject to alter or reject 
his hypothesis, and thereby produces 
a very different response series. 

It is worth noting that this view, 
presented here to account for be- 
havior during blank trials, entails two 
ideas of general significance. One is 
that the behavioral unit is a mediating 
process of the sort just described. The 
other is that “right” functions not as 
a strengthener of the choice response 
but as a maintainer of the mediating 
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process. Both are necessary assump- 
tions for treating the results of the 
experiments described above. With- 
out the first it is difficult to explain 
the development of alternation learn- 
ing sets (Experiments I and II), the 
transfer of these sets to Nonoutcome 
problems (see Levine, 1963, for a de- 
tailed presentation of these argu- 
ments), and the curious fluctuations 
during the 10 test trials of Experiment 
IV. The second assumption, in con- 
junction with the first, accounts for 
the absence drift toward 100% per- 
formance of one of the two choice 
responses when, during the test trials, 
the experimenter says only “right.” 
Several human learning models in- 
corporate assumptions sufficiently like 
these two that with minor modifica- 
tions they could qualitatively account 
for the present results. These all in- 
voke a mediating process either di- 
rectly as such (Kendler & Kendler, 
1962; Levine, 1963; Restle, 1962) in 
terms of selective sampling of cues 
(Bower & Trabasso, 1963), or in 
terms of an observing response (At- 
kinson, 1961). Restle, and Bower 
and Trabasso, make most exactly the 
assumption that positive reinforcement 
maintains the mediating process sam- 
pled (as opposed to strengthening one 
of the two choice responses) but the 
assumption is closely approximated by 
the others. The chief modifications 
necessary in most of these models to 
account for the blank trials law in 
its present context are (a) the expan- 
sion of the mediation concept, which 
is typically a mechanism for cue se- 
lection to handle sequence learning, 
and (b) the addition of the assump- 
tion that during blank trials also the 
subject maintains the same mediating 
process. 
The general theory just sketched ac- 
counts for some puzzling findings from 
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Buss and Bu 
(1956), using the Wisconsin Car 
Sorting Test, compared the effects 9 
saying “right” versus “wrong” 
correct and incorrect responses, re 
spectively, with the effects of saying 
nothing versus “wrong” for these two 
types of responses. The theorem pre- 
dicts no differences between the two 
groups and that is what Buss and 
Buss found (see also Buss, Braden 
Orgel, & Buss, 1956). More striking, 
however, is the fact that in the latter 
group almost all the subjects (14 out 
of 15) reached the criterion of 
consecutive correct responses with 
experimenter never saying a word! 
According to the theoretical idea S 
elaborated above, the subjects selected 
the correct hypothesis at the ou 
and the correct behavior pa 
emerged undeterred. One other fir 
ing in this area comes from Buchwald 
(1959). He has found that once a 
discrimination has been learned follo 
ing the usual outcome procedi 
(“rights” versus “wro 8) the 
Tect response does not extinguish 
(over 72 trials) when throughout ex- 
tinction the experimenter says noth: 
ing. The blank trials law nice 
predicts these results. A mediation 
process model nicely elucidates them. 


other experiments. 
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Anger arousal creates a readiness to act aggressively but suitable 
cues—stimuli associated with the present or previous anger instigators 
—presumably are necessary if aggressive responses are actually to 
occur. In cases of hostility displacement, as others have proposed, 
the target evokes aggressive responses from an angered person. 
Aggressive cues in the angered person’s thoughts or in the external 
situation “activate” an aggressive response sequence which does not 
attain completion until the anger instigator is injured. Preventing 
the completion of the activated sequence heightens the arousal state 
and this (a) increases the strength of any subsequent aggressive 
(b) raises the level of tension experienced by the person. 

the ongoing aggressive sequence by inflicting injury 
often leads, in the absence of guilt or anxiety, to a pleasurable re- 
duction in the experienced tension. 


responses, 
Completing 


Few observations in the behavioral 
sciences are as trite as the one boldly 
proclaiming that an individual’s ac- 
tions are a product of both environ- 
mental conditions and internal states. 
But although we are often told that 
B = f (PE), this proposition is some- 
times unduly neglected by students of 
motivation. Even when it is not for- 
gotten, the basic formulation is usually 
only paraphrased, and we are not given 
any specific statement as to the man- 
ner in which environmental stimuli 
and internal states interact to deter- 
mine the direction and intensity of the 
person’s strivings. Such a neglect 
seems to be particularly characteristic 
of writers in the field of aggressive 
behavior. The present paper will at- 
tempt to demonstrate that aggression 
cannot be completely understood with- 


1The present paper is an expanded ver- 
sion of a talk delivered at the AAAS- 
Academy of Psychoanalysis symposium, 
“The role of violence in human behavior,” 
held in Philadelphia in December 1962. 
The author’s research reported here was 
conducted under grants M1540, National 
Institute of Mental Health, and G23988, Na- 
tional Science Foundation. 
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out a consideration of both inte 
conditions and environmental stimuli. 

Some preliminary remarks are in 
order. Aggression“ is here defined 
as behavior whose goal is the injury 
of some person or object (cf. Dollard, 
Doob, Miller, Mowrer, & Sears, 1939), 
while “anger” is here regarded as the 
emotional state resulting from a frus- 
tration presumably creating a readi- 
ness for aggressive acts. In order to 
simplify this discussion, I might also 
note, comparatively little attention will 
be given to the effects of internal con- 
trols and inhibitions upon these ag- 
gressive responses. ; 

Summarizing briefly, the paper will 
argue that anger and learned habits 
Separately or together create a readi- 
ness to act in a hostile manner, and 
particular cues—stimuli associated 
with the present or previous anger 
instigators are necessary if aggres- 
sive responses are actually to occur. 
A variety of experimental findings 
Consistent with this thesis will then 
be described, after which we will con- 
sider some of the implications for such 
behavioral phenomena as displacement 
and hostility catharsis. 


AGGRESSIVE CUES 


In the usual commentary on hostile 
behavior the instigation to aggression 
is pictured as an ever-active energy 
source constantly impelling aggressive 
responses. Whether these responses 
appear in overt behavior, however, sup- 
posedly depends primarily on the pres- 
ence of inhibitions or other avenues 
of expression, and not on the stimulus 
qualities of the available targets. Thus, 
once his aggressive “drive” has been 
aroused, whether by appropriate bio- 
logical processes or some frustration, 
a person presumably is constantly in- 
clined to attack someone—anyone— 
unless he can displace his urge onto 
some inanimate object, or divert the 
drive into some socially acceptable 
channel. 

Hartmann, Kris, and Loewenstein 
(1949) provide an excellent illustra- 
tion of this type of thinking. Their 
revision of Freud’s basic doctrine 
posits an innate biological drive based 
on unspecified internal sources. Al- 
though they do state (almost in pass- 
ing) that disliked or frustrating ob- 
jects “invite the discharge” of the 
aggressive energy, these writers place 
particular stress on the “assumption 
of a constant driving power com- 
parable to that of the libido [p. 29].” 
Like libido, aggressive energy sup- 
posedly must find some outlet. The 
urge may be displaced, sublimated, or 
neutralized, but after expression it 
builds up again, and again is said to 
seek release. The drive object is 
theoretically relatively unimportant in 
this process. The object, they main- 
tain (citing Freud), becomes attached 
to the drive “only in consequence of 
being peculiarly fitted to provide satis- 
faction [p. 20].” A particular target 
is attacked not because it has the 
stimulus qualities to evoke aggression 
but because of the satisfactions earned 


in the course of aggressing against this 


object. 
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In one way or another, many dis- 
cussions of aggressive behavior em- 
ploy a somewhat similar analysis, re- 
gardless of whether they see aggres- 
sion as instinctual or acquired, the 
expression of internal energy or a re- 
action to frustration. As one example, 
the eminent anthropologist, Clyde 
Kluckhohn (1945) suggested that there 
was at least some “freefloating aggres- 
sion” in every society because of the 
deprivations and frustrations suffered 
by the people in it through the course 
of -their lives. This pent-up force 
constantly seeking expression is said 
to be drained off in some societies 
through periodic wars, but other cul- 
tures presumably are able to channel 
the energy into more constructive 
paths such as literature and public 
works.* 


2 The present analysis fully concurs with 
Allport’s (1954, p. 358) criticism of the 
“drainage theory” of aggression. Writers 
influenced by the orthodox Freudian con- 
ception of aggression have envisioned a 
free-floating aggressive energy which may 
be channeled into many different activities 
(e. g., attempting to master others) or against 
a wide variety of objects. Carrying out 
any of these substitute actions, or attacking 
any of the substitute objects, then sup- 
posedly drains the reservoir of aggressive 
energy. Contrary to such reasoning, how- 
ever, there is evidence that hostility toward 
one group of people is associated positively 
with enmity toward other groups. The 
ethnocentric individual, we know today, has 
learned to dislike a remarkably wide range 
of people (Adorno, Frenkel-Brunswik, 
Levinson, & Sanford, 1950). He does not 
become friendlier toward one outgroup 
merely as the result of expressing hostility 
towards other such outgroups. The aggres- 
sive action does not in itself weaken his 
habitual modes of conduct toward strangers 
or people who are different from him. 

The habitually hostile individual does not 
have a reservoir of aggressive energy 
pressing for discharge. Nor, I believe, 
does he have a year-long accumulation of 
anger pent up within him. His frequent 
aggressive outbursts are not emotional ex- 
plosions from a storehouse of repressed 
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I do not mean to question the 
psychoanalytic notion of hostility dis- 
placement. Much of my recent re- 
search has been concerned with this 
phenomenon (e.g., Berkowitz, 1962; 
Berkowitz & Green, 1962), and I have 
little doubt that it exists. My argu- 
ment is with another aspect of the 
above conceptions. Typically, these 
views of aggression assign a very 
minor role to stimulation from the 
external environment. Such stimula- 
tion theoretically affects only whether 
the person believes aggression is safe 
or morally justified. For example, in 
the usual discussion of the scapegoat 
theory of prejudice (e.g., Williams, 
1947) the victim is supposedly attacked 
primarily because he is a visible and 
safe target for pent-up hostility within 
the prejudiced individual. He is a 
“conveniently chosen outlet,” in the 
words of Hartmann and his colleagues, 
who is selected largely because the 
attacker anticipates little or no punish- 
ment in return from himself or others. 
To put this formulation in simpler 
terms, the aggressive drive supposedly 
“pushes” aggressive acts unto what- 
ever target happens to be available 
and is safe to attack. My own po- 
sition is somewhat different: Basically, 
I suggest that a target with appropri- 


energy or anger but, rather, are reactions 
evoked by appropriate cues. There may 
well be no aggressive responses, either overt 
or internal and hidden, if he did not en- 
counter such stimuli. What this means, 
then, is that his customary aggressive be- 
havior actually is not revealed unless there 
are relevant stimuli in the situation con- 
fronting him. Moreover, it is not necessary 
to provide substitute activities in order to 
lessen the likelihood of aggressive behavior 
even if anger is occasionally generated by 
frustrations (the blocking of ongoing goal- 
directed activity). Unless the thwarted per- 
son is kept aroused or is re-aroused by 
being reminded of his frustrations, his anger 
probably will dissipate with time, and the 
probability of aggressive behavior will de- 
cline. 
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ate stimulus qualities “pulls” (evokes) 
aggressive responses from a person 
who is ready to engage in such ac- 
tions either because he is angry or 
because particular stimuli have ac- 
quired cue value for aggressive re- 
sponses from him.“ 

Such a conception has several im- 
portant implications. For one thing, 
we do not have to think of a person 
as being constantly driven to attack 
someone unless his aggressive im- 
pulses can be diverted into socially 
acceptable actions. His likelihood of 
engaging in hostile behavior can be 
substantially lowered, theoretically at 
least, by removing external stimuli 
capable of evoking aggressive re- 
sponses. 


Similar Conceptions in Other Fields 


The position just outlined is similar 
to the ethological concept of releaser. 
A releaser, or sign stimulus, is a cue 
in the external environment which pro- 
duces a given reaction from an organ-. 
ism ready to make such a response. 
To illustrate, Tinbergen (1951) has 
reported that an aroused male stickle- 
back fish reacts with fighting behavior 
to the red on the belly of a rival male. 
This red stimulus evoking the attack 
is highly specific; aggression does not 
occur, for example, if the red is on the 
rival's back. The evoking stimulus, 
furthermore, has little if any effect 


In their respective discussions of ag- 
gression, both Buss (1961) and the present 
writer (1962) independently came to empha- 
size the role of external stimulation in 
evoking aggressive responses. For Buss 
anger “energizes” aggressive behavior and 
lowers the “threshold for the occurrence of 
aggressive responses [p. 60].” However, 
contrary to the present formulation, Buss 
does not seem to regard anger as only a 
predisposing state, requiring the presence 
of some cue functioning as a releaser if 
the aggressive behavior is actually to oc- 
cur, 
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if the organism is not in a suitable 
condition. In the case of stickleback 
aggression the necessary prerequisite 
apparently is the production of hor- 
mones associated with reproduction. 
Given the required internal state, the 
appropriate stimulus releases behavior 
the organism is ready to make. There 
clearly is an interaction between the 
organism’s internal condition and the 
external stimulation. Research has 
shown that a strong releaser is neces- 
sary to elicit a reaction if the animal 
is under a low arousal condition, while 
a weaker releaser is sufficient if the 
animal is in a stronger motivational 
state (Hess, 1962, p. 185).4 

Clifford Morgan’s (1959) physio- 
logical theory of drive also has im- 
portant similarities to the present 
conception. According to Morgan, 
arousal conditions, such as food de- 
privation or the accumulation of sex 
hormones in the blood, create a moti- 
vational state within the central ner- 
vous system (Central Motive State) 
which predisposes the organisms to 
react in certain ways to particular 
stimuli. Although Morgan suggests 
that some of these motive states may 
lead to specific patterns of behavior 
independently of external stimulation, 
as evidently happens in the case of a 
female cat in heat, he places particu- 
lar stress on the “priming” function 
of the CMS. “Certain reactions are 
preset or primed by drives,” he states, 
“so that they are released when the 
appropriate stimulus is present [p. 
660].“ 

For many different kinds of be- 


The ethological concepts of “vacuum 
activity” and “displacement,” which attribute 
certain activities solely to surplus energy 
within neural centers, appear to contradict 
the present analysis. However, as the 
writer has pointed out elsewhere (1962, p. 
20), these conceptions have been questioned 
by various authorities (also cf. Hinde, 1960, 
pp. 207-208). 
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havior, then, the response depends 
upon the joint operation of both in- 
ternal states and external stimulation. 
Evidence obtained several years ago 
by Campbell and Sheffield (1953) in- 
dicates that even random activity fol- 
lowing prolonged food deprivation re- 
quires external stimulation. Contrary 
to the usual simple notion of an in- 
ternal imbalance driving the organism 
into activity, rats starved for three 
days displayed heightened activity 
only when an environmental change 
was created. The internal condition 
alone did not lead to the random 
behavior. 

For more narrowly defined, focused 
responses rather than general activity, 
more specific stimuli evidently are 
necessary. Research into the effects 
of electrical stimulation of chicken 
brains (von Holst & von Saint Paul, 
1962) suggests that specific situational 
cues are required in evoking aggres- 
sive behavior. In these experiments 
stimulation of a certain region of the 
fowl brain led to organized patterns 
of aggressive behavior primarily when 
relevant cues (“an enemy, real or 
artificial”) were present. An elec- 
trically stimulated rooster would ex- 
hibit “only motor restlessness” when 
“all substitutes for an enemy” were 
lacking. 

In general, a variety of research 
findings suggest that external stimu- 
lation often has a vital role in evoking 
motivated behavior. Particular inter- 
nal states may be necessary, but such 
conditions are not sufficient; in many 
kinds of behavior overt activity does 
not occur unless appropriate situa- 
tional cues are also present to elicit 
the actions. Some internal emotional 
states, probably including anger, result 
only in a general excitement or diffuse 
motor restlessness in the absence of 
such evoking stimuli. To go even fur- 
ther, for some kinds of physiological 
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arousal at least, external stimulation 
can increase the internal “drive” state 
as well as provide the releasing cues 
(cf. Malmo, 1962, pp. 402-405). 


Aggression-Evoking Stimuli 


Elsewhere (Berkowitz, 1962) I have 
suggested that an object is capable of 
evoking aggressive responses to the 
extent that it is associated with previ- 
ous anger or aggression instigators. 
The association is with the immedi- 
ately preceding frustrater in the case 
of a person who had just been angered, 
and probably is with the people from 
whom aggressive actions had been 
learned in the case of those with ag- 
gressiveness habits. 

Such a formula, of course, is only 
an extension of Neal Miller’s (1948) 
well-known analysis of displacement 
in terms of stimulus-response generali- 
zation. Applying the conditioning 
principles developed by Pavlov and 
others, Miller had proposed that ten- 
dencies—to make or to avoid making 
a given response—originally elicited 
by a particular object generalize to 
similar objects. The strength of this 
generalization is in direct ratio to the 
association between the latter objects 
and the original instigator. Thus, sup- 
pose a man is angered by Person A 
but cannot attack him because A is 
absent from the scene. If the man 
should then encounter other people 
with varying degrees of association 
with Person A, these people will elicit 
hostile reactions from him in propor- 
tion to their association with A. That 
person bearing the strongest associa- 
tion with A will “pull” hostile re- 
sponses which, although not as strong 
as those the original instigator would 
have elicited, are still stronger than the 
responses evoked by people having 
weaker associations with A. (We 
cannot here go into the complications 
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introduced by the presence of inhibi- 
tions against aggression. For a discus- 
sion of these complexities see Miller, 
1948, and Berkowitz, 1962, pp. 107- 
129.) 

The implications of this model should 
be made explicit. As I said earlier, 
it maintains that the angered individual 
is only aroused, i.e, primed, to commit 
an act of aggression. Whether he does 
perform such an act depends upon, 
among other things, the presence of 
stimuli associated with the anger insti- 
gator. Anger intensity does affect ag- 
gression. The intensity of the aggres- 
sive response evoked by an object 
probably varies directly with both the 
intensity of the anger and the strength 
of the association between the object 
and the original angering agency. 

An experiment by Weatherley 
(1962) provides an excellent demon- 
stration of the interaction of cue and 
emotional state in aggressive behavior. 
Female college students whose mother 
had been either high or low in permis- 
siveness toward aggression were de- 
liberately angered (in one condition), 
or were not provoked, and then were 
given projective test (TAT) cards con- 
taining strong or weak cues for aggres- 
sion. (That is, the scenes depicted on 
these cards were either strongly or 
weakly associated with common aggres- 
sive situations.) Maternal permissive- 
ness toward aggression probably leads 
more to relatively weak internal re- 
straints against aggression than to the 
development of a strong, constantly 
active aggressive “drive.” Thus, it is 
not surprising that the permissively 
reared students in the nonaroused con- 
dition gave no more aggressive re- 
sponses to the TAT than did the less 
permissively trained group in this con- 
dition ; the two nonaroused groups did 
not differ in their instigation to make 
aggressive responses. It was not until 
the women were provoked that signifi- 
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cant findings emerged, but here too, 
relevant cues were necessary. The 
angered students whose mothers had 
permitted aggression exhibited reliably 
more fantasy aggression than the simi- 
larly reared but nonprovoked group 
only to the strong cue cards. Their 
aroused hostile inclinations were not 
revealed, even though their inhibitions 
were fairly weak, unless aggressive cues 
were present. 

The scenes portrayed on the projec- 
tive test cards presumably served as 
aggression-evoking cues to the extent 
that they depicted situations in which 
aggression ordinarily occurs. The sub- 
ject apparently associated the high cue 
scene with prior situations in which 
she had behaved aggressively. Because 
of this tie the card could evoke hostile 
responses from her fairly easily if she 
was now angry and ready to make 
such responses. In the next study the 
aggression-eliciting stimuli were as- 
sociated with the immediately preced- 
ing arousal condition rather than with 
common aggressive situations. 

In this preliminary experiment“? 
(based on only seven subjects in each 
condition), male college students, taken 
one at a time, were first given a task 
by one of the two experimenters. This 
experimenter was introduced, depend- 
ing on the condition, either as (a) a 
graduate student in physical education 
and a college boxer, or (b) a graduate 
Student in Speech. Then, for half of 
the cases given a particular introduc- 
tion, the experimenter (Experimenter 
1) deliberately insulted the subject dur- 
ing the first task, while in the remain- 
ing cases he treated the subject in a 
neutral, nonangering manner. Fol- 
lowing this, the second (or neutral) 
experimenter showed the subject a 
7-minute-long film clip, either a fairly 


The experiment was conducted by the 
Present writer, Frederick DeBoer, and 
Mark Heironimus. 


brutal prize fight scene or one deal- 
ing with the travels of Marco Polo. 
(We might also note that the subjects 
seeing the prize fight were given a 
brief story outline which, according 
to our earlier findings [Berkowitz & 
Rawlings, 1963], serves to lower in- 
ternal restraints against aggression.) 
At the conclusion of the movie, after 
the subject had completed a question- 
naire assessing his reactions to the 
film, he was asked to rate each of the 
experimenters on special forms. Since 
these questionnaires supposedly were 
to go to the Dean’s Office as part of a 
survey of student reactions to experi- 
ments, unfavorable ratings of the grad- 
uate student-experimenters could con- 
ceivably hurt them and, therefore, are 
here taken as acts of aggression. We 
will be concerned with the two critical 
questionnaire items asking whether the 
subject would like to have the given 
experimenter as a Discussion Section 
instructor, and whether he would like 
to be in another experiment conducted 
by this person. The scores on the two 
items were summed for each experi- 
menter to give us a single index of ex- 
pressed unfriendliness toward the ex- 
perimenter. 

Now, what do we expect? The in- 
sulting experimenter (Experimenter 
1), as source of the subject’s anger, 
should of course receive greater ex- 
pressions of hostility than the neutral 
experimenter. But remember also that 
in some cases Experimenter 1 is fairly 
strongly associated with an aggressive 
film; he is seen as a “boxer” by some 
of the men witnessing a movie about 
boxing. If this aggressive movie 
tended to arouse previously learned ag- 
gressiveness habits, as I believe is 
often the case (cf. Berkowitz, 1962, 
Ch. 9), the aroused hostile inclinations 
should generalize most strongly to ex- 
ternal stimuli bearing the strongest 
psychological ties to the film. In other 
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TABLE 1 
MEAN CONDITION SCORES ON THE HOSTILITY INDICES, QUESTIONNAIRE RATINGS 
or EACH or THE Two EXPERIMENTERS 
Subjects angered by Experimenter 1 Nonangered subjects 
Film: Boxing movie Travel movie Boxing movie 


Identification of Experimenter 1 Boxing speech Boxing speech Boxing speech 
Experimenter 1 21.42, 15.29rea | 16.00 17.435 11.00, 10.71, 
Experimenter 2 12.864. 9.14. 10.10, 11.714. 10.43, 12. 14.4. 


Note. — The ratings of the two experimenters were subjected 
in common are not significantly dii 
ness toward the given experimenter. Each 
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words, the insulting experimenter 
should receive the most hostile ratings 
when he is also relatively strongly as- 
sociated with the other arousal source: 
the aggressive prize fight scene, i.e., 
when he has been labeled a boxer 
rather than a speech student. 

The results summarized in Table 1 
indicate that this expectation was ful- 
filled. The greatest hostility was di- 
rected toward Experimenter 1 when he 
had insulted the subjects and when he 
had some association with the prize 
fight movie. Since he was associated 
with both arousal sources, the insult 
and the aggressive film, he had a rela- 
tively strong cue value for aggressive 
responses. He had the stimulus qual- 
ities which “pulled” fairly intense hos- 
tility from the subjects who were 
primed to make aggressive responses 
because of the insult they had received. 

Much of the experimental research 
employing notions of the sort described 
here involves associations based on 
physical similarities. In the last study, 
however, Experimenter 1’s association 
with the aggressive film was varied by 
means of verbal labels applied to him; 
he was a “boxer” or he was a “Speech 
student.” One of the major difficulties 
in using associationistic hypotheses to 
deal with socially important psycholgi- 
cal problems stems from the great 
variety of ways in which associations 


to one “repeated measures” analysis of variance. 
ferent at the .0S level. The higher the score the greater 
mean is on seven cases. 


arise. We cannot always specify be- 
forehand just what will be the basis 
for a psychological tie between various 
objects or, for that matter, whether a 
person will regard the objects as being 
different in important ways. Our 
thoughts may equate different objects 
for a number of reasons. We may 
regard objectively different people as 
being similar, for example, because we 
have learned to apply the same label to 
them (e.g., “They’re all Democrats”), 
or perhaps because they arouse the 
same emotional feeling in us. 

The hostility displacement involved 
in the scapegoat theory of prejudice 
can be readily understood in terms of 
associations created through the ag- 
gressor’s thoughts (Berkowitz, 1962, 
Ch. 6; Berkowitz & Green, 1962). 
Our research at Wisconsin indicates 
that the hostile responses evoked by one 
frustrater generalize fairly readily to 
other disliked people. Although the 
evidence for a generalization process is 
not as good as we would want in these 
studies, we believe that the thwarted 
person somehow associates his victims 
with the immediately frustrating 
agency. All of these people are dis- 
liked, for example, and all may be re- 
garded as essentially alike in being 
unpleasant. An industrial worker 
thwarted by his employer may displace 
his resentment onto Jews, then, be- 
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cause he previously had learned to 
dislike Jews and, disliking them, sub- 
jectively associates Jews with the dis- 
liked and frustrating employer. He 
may also associate Jews with the em- 
ployer because he thinks of all of these 
people as “businessmen,” and associ- 
ation bonds can summate (cf. Berko- 
witz, 1962, p. 160). However the 
association arises, an angered person 
presumably associates the scapegoat 
group with his more immediate frus- 
trater. Because of this tie, the scape- 
goat pulls hostile responses from the 
person who is ready to act aggressively. 


Hostility Catharsis 


The idea of a hostility catharsis is 
surely one of the most widely accepted 
doctrines in the folklore of both the 
man in the street and the social sci- 
entist. For centuries, going back to 
Aristotle, people have believed that 
the exhibition of an emotion can 
“purge” an individual of that emotion. 
Contemporary formulations of this no- 
tion generally deal with anger and ag- 
gressive behavior, and generally main- 
tain that a person’s hostile actions can 
reduce the strength of his aggressive 
tendencies, or somehow make him feel 
better, 

The conception of aggressive be- 
havior outlined so far has some fairly 
definite implications for the analysis 
of hostility catharsis. Before going 
into these implications, however, some 
basic distinctions must be made, in 
essence considerably restricting the 
phenomena included within the scope 
of the catharsis hypothesis. In the 
Psychological literature, at least, ca- 
tharsis” seems to refer most frequently 
to a lessening in the strength of the 
instigation to aggression supposedly 
Produced by a hostile act. Such a 
‘drive reduction” is not necessarily in- 
volved in the second meaning of the 


term, a feeling of pleasure or tension 
reduction presumably arising from 
the performance of an aggressive re- 
sponse. Empirical evidence indicates 
the thwarted person may obtain some 
tension release by attacking his frus- 
trater (cf. Berkowitz, 1962, Ch. 8; 
Berkowitz, Green, & Macaulay, 1962), 
especially if he earlier had wanted to 
and had been prepared to aggress 
against that individual but, for some 
reason, had not been able to do so right 
away. This tension reduction can be 
pleasurable, but it is problematic 
whether the overt hostility will neces- 
sarily lessen the likelihood of any fur- 
ther aggression against this frustrater. 
Even if his aggressive act does lessen 
the thwarted person’s immediate anger, 
an aggressiveness habit (or negative 
attitude) may develop; the frustrater 
may acquire the stimulus properties 
which, under the appropriate condi- 
tions, can cause him to evoke aggres- 
sive responses from his victim on some 
later occasion. In a word, the frus- 
trater may function something like 
Martin Luther King to a rabid South- 
ern segregationist. 


Research into Hostility Catharsis 


The present view of aggression as 
evoked behavior suggests that some 
additional details should be incorpo- 
rated into the writer’s previously pub- 
lished analysis of hostility catharsis. 
Summarizing, I have maintained 
(Berkowitz, 1962, Ch. 8; Berkowitz, 
Green, & Macaulay, 1962) that much 
of what is commonly meant by the 
term “catharsis” can be understood as 
a special case of a “completion tend- 
ency.“ 

The idea of a completion tendency 
certainly is not new in psychology. 
Lewin had discussed such a proposi- 
tion in the 1920’s, and two of his stu- 
dents have published supporting data. 
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Thus, we have Zeigarnik's demonstra- 
tion that there is better memory for 
incompleted tasks than for completed 
ones, and Ovsiankina's study indicating 
that people are more inclined to resume 
interrupted activities than to take up 
tasks again that they had recently com- 
pleted (Lewin, 1935, pp. 242-244). 
Failure to complete the tasks pre- 
sumably created the tension leading to 
their better recall and the tendency to 
resume them, 

An activity will tend to continue, of 
course, until the activity goal is reached. 
Preventing the organism from reach- 
ing this goal, i.e., interrupting the ac- 
tivity, can be a source of disturbance. 
Sheffield (1954) has hypothesized that 
there is an “increase in excited emo- 
tion” when a consummatory response— 
completing the activity sequence—is 
stimulated but not allowed to occur 
completely. 

Applying this formulation to aggres- 
sive behavior, we would say that in- 
flicting injury on the intended target is 
the goal or consummatory response 
completing the aggressive response se- 
quence. As long as a person is insti- 
gated to aggress he does not obtain 
“completion” until he sees that he has 
injured his target, or that someone 
else has done so. 

Tension Reduction Following “Com- 
pletion.” Perhaps the best demonstra- 
tions of tension reduction following 
the completion of an aggressive se- 
quence have been reported by Ho- 
kanson. In the first of his experi- 
ments (Hokanson & Shetler, 1961), 
male and female university students 
were led to believe either that the ex- 
perimenter was a high status person— 
in this case, a middle-aged “university 
professor”—or that he was a low 
status person—a fellow student. 
Half of the people in each of these 
two conditions, while working singly, 
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were deliberately angered by the 
experimenter, while the remaining 
subjects received a kindlier treatment 
from him. The systolic blood pressure 
obtained at this time showed that phys- 
iological tension“ increased in all of 
the insulted groups independently of 
the instigator’s status. Following the 
arousal, half of the subjects in each of 
the now four conditions were provided 
with a series of sanctioned opportuni- 
ties to give electric shocks to the 
experimenter. Physical aggression 
against the low status instigator ap- 
peared to be tension reducing. The 
systolic blood pressure in the angered, 
aggressing students decreased to the 
level existing in the nonaroused group, 
but the pressure remained high in 
those people not shocking their insult- 
ing peer. The latter presumably be- 
lieved they had not injured the in- 
stigator and, therefore, did not com- 
plete the aggressive sequence. 

The results are somewhat puzzling 
for the high status experimenter con- 
dition. In this case, systolic pressure 
decreased in both the nonaggressing 
and aggressing angered students to the 
level within the nonprovoked groups. 
We might conjecture that the insulted 
nonaggressing students had quickly 
given up the idea of attacking the high 
status professor. They no longer 
wanted or expected to aggress against 
him so that, in essence, an instigated 
aggressive sequence was no longer in 
operation. If such an interpretation is 
correct, these findings lend further sup- 
port to my suggestion that anger 
arousal in itself does not lead to ag- 


ë Many neurophysiologists feel that one 
particular conscious state cannot always be 
attributed to a given physiological pattern. 
Thus, high systolic blood pressure is not 
necessarily indicative of the same conscious 
emotion in all people. The most parsimonious 
interpretation is to regard high systolic pres- 
sure as indicative of physiological tension. 
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gressive responses even if these re- 
sponses are inhibited. 

A second experiment (Hokanson & 
Burgess, 1962) showed that the find- 
ings obtained in the low status experi- 
menter condition are reliable and, in 
addition highlight the importance of 
perceived injury to the instigator. 
Only a low status experimenter was 
used this time but, as before, half of 
the subjects, male and female college 
students, were deliberately angered and 
half were not. Also as in the earlier 
experiment, one group of subjects could 
shock the experimenter in a permissible 
manner after the provocation, while 
the others were given a more neutral 
task. Two other treatments were also 
carried out. One group created a 
story in response to a TAT card, and 
a final condition gave the experimenter 
questionnaire ratings (i. e., this last 
condition had an opportunity to ad- 
minister verbal aggression) after the 
arousal. 

Systolic blood pressure and heart 
rate measures again indicated that there 
was a significant physiological arousal 
following the provocation. The frus- 
trated subjects given an opportunity 
to injure their tormenter, either phys- 
ically (by shocks) or verbally (through 
the ratings), then showed a lessening 
in physiological tension so that their 
final scores were not different from 
those in the nonfrustrated groups. The 
physiological tension remained high, 
however, in the insulted TAT and neu- 
tral response conditions. Assuming he 
wants to and is prepared to attack his 
frustrater, the angered individual must 
believe he has injured that person in 
retaliation for the injury he has re- 
ceived if there is to be a physiological 
relaxation. Fantasy aggression in re- 
sponse to the TAT card in this study 
did not provide “closure.” Fantasy 
aggression in general, I believe, is 
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relatively unlikely to be satisfying, i. e., 
produce completion. 

Before proceeding further, let us 
again note the important proviso: there 
is no completion (in the sense used in 
this paper) unless there has been an 
ongoing aggressive sequence. What 
are the conditions determining whether 
such an “activated” response chain ex- 
ists? Consider the case of Person A 
who has been angered by Person B. 
The anger arousal in itself, I con- 
tended, does not necessarily give rise 
to an ongoing aggressive sequence. 
Appropriate cues, stimuli associated 
with the anger instigator, presumably 
must also be present if the aggressive 
responses are to be made. Once such 
responses occur (perhaps only im- 
plicitly in his thoughts), Person A 
theoretically will not obtain completion 
until he believes that B has been ag- 
gressively injured. 

However, if the aggressive sequence 
is set into operation, but completion is 
prevented, internal tension is induced 
which is channeled into whatever re- 
sponse happens to be under way at the 
time. (This last proposition is consist- 
ent with views offered by many psy- 
chologists—e.g., Amsel, 1958; Brown 
& Farber, 1951; Mowrer, 1960; Sears, 
Whiting, Nowlis, & Sears, 1953.) Ex- 
tending the proposition to aggressive 
behavior, we have the frustration-ag- 
gression notion that blocking aggressive 
reactions increases the predisposition 
to further aggression. 

An experiment by Thibaut and 
Coules (1952) provides data support- 
ing this latter prediction. In the first 
part of their paper they reported that 
male college students who were pro- 
yoked by an insulting message from a 
peer and then were given an oppor- 
tunity to communicate back to him had 
less residual hostility toward the in- 
stigator at the end of the session than 
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did a similarly angered group not pro- 
vided with this communication oppor- 
tunity. However, the investigators 
were well aware that they had not 
really demonstrated the existence of a 
hostility catharsis. The condition dif- 
ferences could be due to a frustration- 
induced rise in hostility in the No 
Communication condition. In order to 
test this possibility they performed a 
second experiment in which every 
aroused subject was permitted to reply 
to the instigator, but the men in one 
group were interrupted for three min- 
utes before they could avail themselves 
of this opportunity. Analysis of the 
messages sent to the frustrater con- 
firmed the frustration-aggression hy- 
pothesis. The people who had had to 
wait before replying sent a reliably 
greater volume of aggression to the 
instigator than did the students re- 
sponding more rapidly to the provaca- 
tion even though the instigator was not 
responsible for the interruption. 
Heightened Predisposition to Ag- 
gression Following Inhibited Aggres- 
sion. This last finding, of course, is 
another example of hostility displace- 
ment. The aggressive inclination en- 
gendered by the experimenter’s inter- 
ruption apparently had been displaced 
onto the other subject. (Or to put 
this in other terms, the disliked stimu- 
lus person had evoked more intense 
hostility because of the stronger arousal 
state within the interrupted person.) 
However, while such a finding is con- 
sistent with the concept of displacement, 
we usually think of displacement in 
terms of anxiety-inhibited aggression. 
That is, the angered person is unwill- 
ing to attack his frustrater because of 
fear or anxiety and then takes out his 
resentment on a safe, available target. 
But are these two examples really 
different if we consider only the blocked 
aggressive sequence? The anxiety-in- 
duced inhibitions are also frustrating 
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and, like the experimenter's interrup- 
tion in the Thibaut-Coules study, 
should also lead to a heightened arousal 
state. If an angered person cannot at- 
tack his frustrater right away as he 
wants and is set to do because of his 
inhibitions, he is thwarted still more, 
and, as a consequence, there is a 
stronger predisposition to aggress 
against other available targets. Where 
the less inhibited individual directs his 
aggression primarily against the person 
whom he believes had provoked him 
and exhibits relatively little displaced 
hostility toward other objects, the re- 
strained individual shows comparatively 
little overt aggression toward the frus- 
trating agent he cannot or dares not 
attack—but relatively strong open hos- 
tility toward other objects. The ag- 
gression against these latter substitute 
targets has been heightened by the 
thwarting of the direct attack upon 
the anger source. 

Results obtained in one of our Wis- 
consin studies of movie violence dem- 
onstrate such a displacement process 
at work (Berkowitz, Corwin, & 
Heironimus, 1963). Male college 
students, again taken one at a time, 
were deliberately angered by a gradu- 
ate student experimenter in one con- 
dition, while the remaining students 
were given a neutral treatment. Fol- 
lowing this brief interaction, two- 
thirds of the subjects in each of these 
conditions watched the filmed prize 
fight scene I had mentioned earlier, 
and the other subjects saw an equally 
long movie about English canal boats. 
The prize fight subjects were given 
one of two kinds of introductions to 
the film. Half of them were given a 
story outline more or less justifying 
aggression, while the Synopsis pro- 
vided the other subjects in this “ag- 
gression movie” condition did not 
sanction aggression. We have reason 
to belief that the former movie intro- 


duction produced weaker inhibitions 
against aggression than the latter in- 
troduction (cf. Berkowitz & Rawlings, 
1963). Finally, all of the subjects 
completed a questionnaire in which 
they rated the experimenter and the 
experiment. For our present pur- 
poses we can confine ourselves to the 
questionnaire results for the angered 
Men witnessing the prize fight movie. 
What we can expect if inhibitions 
Against aggression did vary with the 
Movie introduction? The angered 
subjects having the relatively weak re- 
straints should exhibit comparatively 
Strong hostility toward the source of 
heir anger (the experimenter) on the 
questionnaire and a lower level 
resentment toward other things, 
such as the experiment, associated 
with him. The anger source, we might 
‘Say, receives more intense aggression 
than other associated objects. The 
more inhibited-angered group, on the 
other hand, would be somewhat less 
likely to attack the anger source, the 
insulting experimenter, and compara- 
tively more likely to attack the as- 
Sociated object, the experiment. This 
Was indeed the case. In the less in- 
hibited-angered group 13 of the 15 
Subjects displayed stronger hostility 
toward the experimenter than toward 
the experiment. However, in the 
angered group presumably having the 
Stronger inhibitions, only 6 of the 15 
showed stronger hostility toward the 
source than toward the associated ex- 
periment (x? = 7.02, p=.01). Thus 
the latter were somewhat more likely 
to displace their resentment onto the 
experiment rather than the insulting 
Experimenter." 


In comparison to the mean scores ex- 
isting in the insulted, less inhibited group, 
the angered inhibited subjects displayed less 
Open hostility toward the frustrating ex- 
berimenter but more hostility toward the 
Experiment. 
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Expectations to Aggress. Use of 
the frustration-aggression formula re- 
quires a clear specification of the 
nature of frustration. Preventing a 
person from satisfying a need we think 
he possesses does not necessarily mean 
the person is actually frustrated. He 
is frustrated, I have argued in other 
pages (Berkowitz, 1962, Ch. 2), when 
an ongoing response sequence is 
blocked. A writer thoroughly en- 
grossed in his work is not frustrated 
just because he has been without food 
for a number of hours; there are no 
ongoing eating response sequences in 
either the writer’s thoughts or his 
overt activity that are prevented from 
reaching completion. He is just not 
concerned with getting food. Simi- 
larly, an angered person is not frus- 
trated merely because he has no op- 
portunity to attack the thwarting 
agency; being angry does not neces- 
sarily mean there is an activated or 
ongoing aggressive response sequence. 

This important qualification must 
be kept in mind in evaluating the re- 
sults of the Rosenbaum and deCharms 
(1960) modification of the Thibaut- 
Coules study. Contrary to the 
earlier investigators, Rosenbaum and 
deCharms found that subjects high in 
self-esteem who were not permitted 
to aggress against their frustrater were 
not more hostile at the conclusion of 
the session than were the other high 
self-esteem men given an opportunity 
for aggression.“ 


8 The low self-esteem subjects appeared to 
show a “catharsis” effect. However, as I 
have suggested elsewhere (Berkowitz, 
1962), the men who were low in self-esteem 
in this study may well have experienced 
strong aggression anxiety and, consequently, 
inhibited signs of open aggression when 
they were given an opportunity to retaliate 
because they were not sure how they would 
be accepted by the other people in the 
experiment. In other words, they were 
tempted to aggress, felt anxious because the 
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Why the difference between these 
two experiments? Obviously it is not 
enough just to say an instigated ag- 
gressive sequence was prevented from 
reaching completion in the Thibaut- 
Coules men while there was no such 
ongoing sequence in the Rosenbaum- 
deCharms subjects. We also have to 
indicate why this difference might 
have arisen. Examination of the pro- 
cedures carried out in the two studies 
suggests a possible explanation for 
the negative results in the latter experi- 
ment which is consistent with the ap- 
proach taken throughout the present 
paper. Unlike the aroused men in the 
Thibaut-Coules study, the provoked 
people in the later Rosenbaum-de- 
Charms experiment had not expected 
to be able to retaliate against their tor- 
menter. Not anticipating an oppor- 
tunity to aggress, there conceivably 
were no ongoing aggressive responses 
in the latter men. More concretely, 
since they knew they would not be able 
to reply to the person who had in- 
sulted them, they probably were not 
thinking of how they were going to tell 
their tormenter off and, therefore, they 
conceivably were not disappointed 
when they could not do so. 

A recent experiment from our Wis- 
consin laboratory was designed to test 
this analysis.“ The subjects, male col- 
lege students, were assembled in pairs, 
ostensibly for an investigation of prob- 
lem solving under stress. Each subject 
worked alone on an assigned problem 


aggressive behavior might bring disap- 
proval, and then leaned over backwards to 
avoid showing such “dangerous” (for them) 
behavior. Rosenbaum and deCharms report 
these subjects exhibited very little open 
aggression when they had the opportunity 
to retaliate against their frustrater and, as 
they acknowledge, the “catharsis” found 
for this group was probably more apparent 
than real. 

9 The experiment was conducted by the 
writer and Ronald Luehrig. 
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believing his partner (who was sitting 
in another room) would give him elec- 
tric shocks if the partner thought the 
problem solution was inadequate. In 
four of the five conditions after the ex- 
perimenter had collected the written 
problem solutions and had left the room 
supposedly to deliver them to the 
partner, he administered eight shocks 
to the subject. Since the subject be- 
lieved they came from his peer in the 
adjoining room, this excessive num- 
ber of shocks generally caused the sub- 
ject to become angry with his partner. 

In two of the conditions the subjects 
had been informed that they would 
have an opportunity to judge the part- 
ner’s work by means of the shocks after 
he had evaluated their performance. 
These people then presumably expected 
that they would have a socially sanc- 
tioned opportunity to retaliate for the 
harsh treatment they had received (Ex- 
pect to Aggress group). The remain- 
ing men had been led to believe they 
would have no opportunity to ad- 
minister shocks to their partner (Ex- 
pect Not to Aggress group). About 
5 minutes after they had received 
their anger arousing shocks all sub- 
jects were shown what was supposedly 
the partner’s solution to another prob- 
lem. The expectations were met in 
only half of the cases. Only half of 
the people in the Expect to Aggress 
condition were permitted to administer 
shocks at this time (the partner did not 
receive any shocks, of course), and 
only half of those in the Expect Not 
to Aggress condition were not able to 
shock their partner. The remaining 
people were told that the experimenter 
had made a mistake in reading the 
schedule. Thus, the other half of the 
men in the Expect to Aggress group 
found that they would not be able to 
give the shocks they had anticipated 
giving, and the other half of the sub- 
jects in the Expect Not to Aggress 
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treatment learned that they actually did 
have an opportunity to evaluate their 
partner via the electric shocks. 

When all the subjects had looked at 
their partner’s solution and had ad- 
ministered as many shocks to him as 
they thought desirable—that is, if these 
shocks had been permitted—they were 
given a brief questionnaire on which 
they rated, among other things, how 
much inner tension they felt at the 
time. Shortly after this questionnaire 
was completed, the subjects were again 
shown what was supposedly the part- 
ner’s performance on yet another prob- 
lem and this time all subjects had an 
opportunity to “rate” the performance 
by giving the partner shocks. A sec- 
ond questionnaire was then admin- 
istered assessing the subjects’ felt ten- 
sion level on this occasion. The 
subjects also rated their partner on a 
brief adjective check list used as a 
hostility measure in other research in 
our project. The experiment was then 
concluded and the deceptions were 
explained. 

In addition to the four conditions 
created by means of the experimental 
manipulations described up to this 
point, a nonangered control group was 
also established. These people had re- 
ceived the minimum one shock as the 
evaluation of their work, had expected 
to be able to shock their partner in re- 
turn, and actually were permitted to 
administer the shocks to him. They 
also completed the various question- 
naires at the appropriate times. 

Our most important question con- 
cerns the effects of the subjects’ initial 
€xpectations. Would those angered 
men who had anticipated being able 
to shock the anger instigator and who 
then were deprived of this aggressive 
Opportunity display signs of stronger 
frustration than the other. subjects be- 
fore the end of the session? The 
Teasoning employed in the present 
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paper suggests that this should be the 
case.” In the words of the more be- 
havioristically inclined psychologists, 
when we expect to obtain a certain 
satisfaction we made “fractional an- 
ticipatory goal responses.” This 
phrase is a good one for our purposes. 
First of all, it suggests that the goal 
is symbolically present in our thoughts. 
Expecting to aggress means the ag- 
gression goal is symbolically repre- 
sented. This representation serves as 
a cue evoking aggressive responses. 
We can say the thought of the goal, 
the anger instigator, is the cue eliciting 
implicit aggressive responses—if we are 
angry and primed. Thus, the phrase 
also implies there is an ongoing ag- 
gressive response sequence if only in 
the thoughts of the angered men in 
the Expect to Aggress condition. 
Those angered people not anticipating 
an opportunity to aggress against 
their tormenter should be less likely 
to have such an aggressive sequence 
in operation and, consequently, should 
be less bothered by not being per- 
mitted to attack him. 

In the present experiment, then, the 
former group deprived of an expected 
opportunity should report a higher 
level of experienced tension before 
they are given the final chance to 
shock their partner. Further, when 
this opportunity is provided, they 
should exhibit stronger overt hostility ; 
the frustrated aggressive sequence 
should heighten the arousal state lead- 
ing to more intense aggressive acts at 
the end of the session. 

The results with our shock measures 
are summarized in Table 2. The first 


10 Some might argue that any heightened 
predisposition to aggression stems from the 
thwarting of a “set” or (if they prefer other 
terminology) “expectancy.” But does the 
use of such concepts result in a better 
explanation? I do not think so. What is a 
set? It may consist of an implicit but 
nevertheless ongoing chain of responses. 
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TABLE 2 
MEAN NUMBER or SHOCKS REPORTED RECEIVED AND NuMBER GIVEN 


BerKowiTz 


Angered a Na 1 


1. Number shocks reported received 
by subject on first problem: 


2. Number shocks given by those 
permitted to "rate" partner's 
first problem after arousal: 


3. Number shocks given as judgment 
of partner at end of session: 


Expect to shock Expect not to shock E 


3.72, | 4.42, | 3.88, | 3.64, | 3.08%% 


Note.—Except for Measure 

soca ioe ia order to facilitate stai 
nonangered group 

sissa groups (F = 17.40, 


istical 


line of data indicates that the people 
in each angered condition reported 
receiving essentially the same number 
of shocks from their partner. That 
the excessive punishment was anger 
provoking is suggested by the condi- 
tion differences in shocks given for the 
partner’s first performance (line 2). 
The two presumably angered groups 


TABLE 3 


MEAN RATING OF THE PARTNER ON THE 
ADJECTIVE CHECK List 


Non- 
Angered angered 


Expect to 
Expect to shock Expect not to shock shock 


Able Not able Able | Not able Able 


19.25ab | 22.44, 17.44% 20.38, | 15.06, 


Note.—The higher the score the greater the indicated 
unfriendliness. ch mean is based on 16 cases. Dif- 
ferences among conditions were tested by £ test em- 
ploying the residual mean square from the preliminary 
analysis of variance in the error term. Cells having a 
subscript in common are not significantly different 
at the .05 level of confidence. 


No, 1, the shock data have been transformed using the Vx + Vx +1 trans- 
nalysis. Each mean is based on 16 cases. 
ga aie fewer shocks to the partner on the first problem than the com 


ve 
< .01). 
erences among studious were tested by ¢ test employing the residual mean square from the preli nai 
pave of variance in the error term. Cells having a subscript in common are not significantly different * 


who were permitted to shock th 
partner at this time gave reliably m 
shocks to him (when combined) th 
the nonangered control group. O 
portunity to attack the anger in 
gator in the adjoining room produ 
cues evoking aggressive respon 
from the angered men. Note, ho 
ever, that there was no difference 
tween the two provoked groups 
number of shocks administered at 
time; the Expect to Shock-Able 
Shock men displayed the same le 
of overt aggression in their fi 
“judgments” as the subjects who h 
not anticipated being able to retalia 
The information given the former 
people that they could shock th 
partner right away had apparently not 
“stirred them up” in the short time 
before aggression was permitted. 
The “expect to aggress“ informati 
evidently did start an implicit aggr 
sive sequence in operation in 
angered subjects, however. As 
last line shows, when the anticipa 
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TABLE 4 
MEAN SCORES ON TENSION INDEX 
Non- 
Angered —— 
Expect to shock Expect not to shock maroc t 
Able | Notable | Able | Notable | Able 
1. After seeing the partner's first | 
performance: 20.48.» | 21.83, 19.27 20.38. 20.246 
2. After giving partner the final 
shocks at end of session: 20.44,» | 20.89. | 18.82, 21.13. 20.27. 
3. Mean change: —0.04 —0.94 | —O45 | 40.75 | +0.03 
4. Number in group decreasing in 
reported tension: 11 6 6 7 


Note.—The tension data were transformed using t! 
scores were subjected to one “repeated measures” anal: 


x + Vx +1 transformation, The two sets of tension 
ted variance with the f tests between condition means 


employing the residual mean square from this anal their error term, Cells having a subscript in common 
are not significantly different at the .05 level N Each mean is based on 16 cases. The higher the score 


the higher the reported tension. 


aggressive activity was blocked in the 
Expect to Shock-Not Able condition, 
a heightened arousal resulted so that 
the men in this group administered 
More severe punishment to the anger 
instigator at the end of the session than 
did the people in any other group. 
The thwarting of the instigated ag- 
gressive sequence apparently had to 
Persist for some minutes before the 
Increased inclination to aggression 
could be revealed in stronger overt 
hostility. 

But while there are indications of a 
frustration-strengthened arousal state, 
Table 2 contains no evidence of a 
cathartic reduction in aggressive ten- 
dencies following the initial display of 
hostility. The subjects in the Expect 
to Shock-Able group gave just as 
many shocks to their partner at the 
end of the session as the people who 
had neither expected nor had been 
able to shock their peer earlier. 

As can be seen in Table 3, essentially 
the same pattern of results was ob- 


tained with the adjective check list 
measure of hostility administered af- 
ter the subjects had given their part- 
ner the final shocks. In this case the 
Expect to Shock-Not Able group was 
more unfriendly to their peer than 
any of the other provoked conditions, 
although this difference did not attain 
statistical significance. These men, 
however, were the only angered men 
expressing reliably greater unfriendli- 
ness to the partner than the control 
group. Also, there is no evidence of 
a cathartic hostility reduction with 
this measure either. 

But while the anger reduction ver- 
sion of the catharsis hypothesis re- 
ceives little support, there is some evi- 
dence for the contention that aggres- 
sive behavior may lead to a lessening 
of experienced tension under some con- 
ditions. The subjects in the present 
experiment rated their mood on three 
separate scales on two separate oc- 
casions. The occasions were: (a) 
after they had been provoked by the 
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partner and had been given or not 
given an opportunity to retaliate, and 
(b) after all subjects had administered 
shocks to the partner at the end of the 
session. The mood items were three 
bipolar scales, relaxed—tense, happy— 
sad, and angry—not angry, on which 
the subject had to indicate his feelings 
dex was established by adding the 
at the time. A combined tension in- 
dex was established by adding the 
three scores, taking care to have a high 
score signify “high felt tension” in 
each case. Now, how did the condi- 
tions differ in these tension scores on 
the two occasions? The results are 
summarized in Table 4. 

As we would expect, the people in 
the Expect to Shock-Not Able condi- 
tion on Occasion 1 reported feeling 
more tense than the men in any of the 
other groups. While they did not 
differ reliably from the other angered 
conditions, these people were the only 
ones to have a mean tension score 
significantly higher than that in the 
nonangered control group. The 
thwarted ongoing aggressive sequence 
in the subjects who had anticipated 
being able to shock their peer but 
then were unable to do so apparently 
produced a relatively high level of felt 
tension within them. 

On Occasion 2, after all subjects 
had been given their final socially 
sanctioned opportunity to aggress 
against their partner, the mean ten- 
sion level in the Expect to Shock- 
Able condition decreased so that it no 
longer differed from that in the con- 
trol group. There was not a sig- 
nificantly greater decrease in this 
group than that in any other condi- 
tion, but the general tendency is cer- 
tainly as we would have predicted: 
The people who had expected to attack 
him earlier but could not do so, and 
thus, whose implicit aggressive re- 
sponses were prevented from reaching 
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completion, tended to experience a 
fairly strong tension reduction when 
they finally were permitted to aggress 
against their tormenter. 

That the tension decrease in this 
condition is more than a regression 
toward the mean is suggested by the 
data given in the last line of the table. 
Since several of the subjects in each 
of the groups, particularly those in 
the Expect to Shock-Able condition, 
showed unusual tension increases, com- 
parisons were also made of the number 
of men in each group exhibiting a 
reduction in felt tension. The results 
of this analysis again indicate that 
the “expect to aggress“ information 
had instigated an implicit aggressive 
sequence in the angered subjects 
which attained at least some degree of 
completion with the final shocks. Com- 
bining the two Expect to Shock 
groups and the two Expect Not to 
Shock conditions, we find a reliably 
greater proportion of the subjects in 
the former “treatment” decreased in 
reported tension from Occasion 1 to 2 


(x? = 4.00, p = 05). 


Completion through Symbolic In- 
jury. We have already noted that 
an aggressive response sequence may 
attain completion through symbolic in- 
jury of the anger instigator. Hokan- 
son and Burgess (1962), it will be 
recalled, had observed a reduction in 
physiological tension in previously 
angered subjects permitted to rate their 
tormenter on a questionnaire: Ad- 


In a later experiment, published as this 
paper went to press, Hokanson, Burgess, an 
Cohen (1963) found that frustrated college 
students given an opportunity to administer 
electric shocks exhibited a reduction in blood 
Pressure level (in comparison to that dis. 
played by frustrated people not permitte 
to aggress) when the object of their attack 
was the frustrater. There were lesser de- 
grees of reduction in physiological tension, 
and not significantly different from that 
shown by the frustrated control group, when 


AGGRESSIVE Cues 


ditional indications of the value of 
symbolic injury are found in re- 
search from our Wisconsin laboratory. 
In an experiment by Berkowitz, Green, 
and Macaulay (1962), when the sub- 
jects were told their peer had done a 
bad job on the assigned task, the 
angered group not permitted to ag- 
gress against him indicated they felt 
reliably better than did the people 
in any other condition. The provoked 
people may have been pleased to learn 
the instigator had done so poorly. In 
other words, they may have believed 
that the experimenter had injured 
their frustrator symbolically by 
“showing him up.” Some support is 
given this interpretation by the findings 
in a more recent experiment by the 
same researchers. Under similar con- 
ditions, angered subjects shown that 
their instigator had done a very poor 
job had a reliable increase in the 
“goodness” of their mood. Such a 
change did not occur in any of the 
nonangered groups or in the aroused 
people who were led to believe the 
instigator had done good work. 


SUMMARY 


What can we conclude from these 
findings? Should we encourage 
people to express whatever hostile 
wishes they might have? An angered 
person does not necessarily make ag- 
gressive responses. Among other 
things, stimuli associated with the 
anger instigator must be present, 
whether in the objective situation or 
in his thoughts, if the hostile activity 
is actually to occur. By encouraging 
the angry person to attack his frus- 
trater we may actually provide him 
... E E E, 


the provoked subjects could shock only sub- 
stitute objects such as the frustrater’s assist- 
ant. Completion apparently is more satis- 
factorily achieved through attacks on the 
frustrater than through aggression against 
substitutes. x 
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with these aggression-evoking cues 
(as well as lower his inhibitions by 
justifying such hostile acts). How- 
ever, if he is thinking about aggres- 
sing against his tormenter and wants 
to do so—in essence, symbolically giv- 
ing himself hostility elicting cues—an 
inability to attack him is a frustration 
and may heighten his already aroused 
aggressive predispositions. Should he 
then be able to aggress against his 
anger instigator he may feel better as 
his internal, frustration-engendered 
tension is reduced. “Feeling better,” 
however, does not in itself mean there 
is a lessened likelihood of attacking the 
anger instigator again on some future 
occasion. The individual may have 
acquired an unfavorable attitude to- 
ward the anger instigator so that on 
some future occasion he may be quick 
to react aggressively to him when he 
is frustrated either by this person or 
someone else (cf. Berkowitz & Green, 
1962). 
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Issues relating to the psychological interpretation of factors are 
surveyed, and possible solutions are suggested. The most basic 
arguments regarding the reality and interpretability of factors are 
examined in terms of preferences for different modes of verbal 
formulation, preferences for different types of theoretical constructs, 
and confusion regarding the function of constructs. The multiplicity- 
parsimony issue largely dissolves when examined in the light of the 
functions of scientific theory, but the relevant concepts of simplicity 
and unity need more operational analysis. Arguments regarding 
causal status stem largely from widespread failure to analyze causal 
concepts and focus directly on the component issues. The closely 


related problems of descriptive generality are examined in the light 


of a modified hierarchical model. 


During the past few decades, there 
has been increasing use of the various 
methods of factor analysis in many 
areas of psychology. Throughout this 
same period, controversies regarding 
the proper psychological interpretation 
of the products of these methods have 
persisted in virtually unmodified form. 
It is the contention of the present 
Writer that many of the points of 
dispute have remained unresolved 
simply because of a failure of writers 
in most quarters to subject the under- 
lying issues to sufficient critical analy- 
sis. It is the purpose of the present 
paper to examine these basic issues 
and offer some suggestions for pos- 
sible solutions. Since arguments have 
arisen chiefly in the context of the 
traditional R-technique analysis of in- 
tellectual performance and personality, 
it is in this area that we shall focus 
primary attention. The basic issues 
that we shall consider, however, are 
broader than this realm of application, 
and the conclusions we shall draw 
should be considered subject to logi- 
cal extension to other applications of 
factorial techniques. 


Tue THEORETICAL STATUS 
or Factors 


Perhaps the question of greatest 
priority is whether factors possess any 
psychological meaning at all. Allport 
(1937) once noted that many factors 
fail to make good psychological sense 
and that they thus run the risk of 
being considered mathematical arti- 
facts. Others have argued that we 
should not attempt to make psycho- 
logical sense of factors as such. 
Anastasi (1938), for example, regards 
the factor as a useful mathematical ex- 
pression of correlational trends. As 
such, it is no more an “artifact” than 
the standard deviation, but it should 
not be construed as representing a 
psychological entity. Essentially the 
same position has been expressed in 
various publications by Thomson 
(1935, 1951). Other writers, like 
Guilford (1940, 1959 ), have con- 
sistently argued that factors may rep- 
resent a kind of psychological reality 
and that, if properly derived from 
properly designed research, they are 
not simply “statistical coefficients.” 
Bartlett (1937), who allies himself 
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fairly closely with Thomson, has even 
argued that Thomson’s own theory 
provides a basis for attributing some 
degree of reality to factors. 

These apparently divergent positions 
are not so contradictory as they might 
appear at first glance. It must be 
granted that, by whatever procedure 
it is derived, a factor remains a 
mathematical function of the correla- 
tions with which one begins. If these 
correlations express covariation among 
psychological variables, the factor will, 
in a somewhat different way, repre- 
sent such covariation. Depending on 
the circumstances of observation, cor- 
relations may reflect peculiarities of a 
particular occasion of testing or of a 
particular sampling of subjects. A 
resulting factor is subject to the same 
influences, as well as those dependent 
on the overall composition of the test 
battery. The use we can make— 
practical or theoretical—of a given cor- 
relation depends on the circumstances 
under which it is replicable. The 
same thing is necessarily true for a 
factor. Whether the correlations are 
governed essentially by relatively 
transitory or by stable influences, they 
represent a kind of psychological 
“reality” which is inevitably mirrored 
in the factor. To the extent that a 
factor fails to be replicable, of course, 
it may be pointless to interpret it, 
since the underlying reality consists 
in large part of “chance”—i.e., of 
influences which we have failed to 
control or which are irrelevant to our 
immediate theoretical interests. 

Since everyone who examines the 
issue must grant that factors possess 
psychological reality in the sense just 
considered, the truly important dif- 
ferences in viewpoint are those re- 
lating to the extent to which, and to 
the way in which, we may justifiably 
draw theoretical interpretations or in- 
ferences from the observed factor. 
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Thomson’s (1951) position is a highly 
conservative one, for he sees the value 
of factor analysis as being almost en- 
tirely practical, rather than theoretical. 
Most other prominent factor analysts, 
however, are interested in assigning 
psychological interpretations to factors 
for theoretical purposes. In examin- 
ing the various viewpoints regarding 
psychological interpretation, we should 
bear in mind that writers in this field 
generally require the term factor to 
perform a double service. Some con- 
fusion arises from the fact that the 
word may refer either to the statistical 
entity yielded by our mathematical op- 
erations or to a psychological interpre- 
tation of this. There might be some 
value in using different terms for the 
statistical concept and the psychologi- 
cal (or more generally, theoretical) 
concept. Cattell (1957, 1961) has 
apparently recognized this need in em- 
ploying source trait to represent his 
psychological interpretation. In a re- 
cent paper (Cattell, 1961) he has 
underscored the fact that the factor 
as a statistical entity relates in a vari- 
able and imprecise way to the under- 
lying source trait. He has not seen 
fit to remedy the terminological am- 
biguity, however, for he continues to 
employ the word factor in a double 
sense. It is doubtful that any termino- 
logical substitution proposed now 
would gain universal acceptance. In 
some discussions, however, the intro- 
duction of subscripts (e.g., using fac- 
tor, and factor, to represent the sta- 
tistical and psychological concepts, re- 
spectively) might make for clearer 
communication. 

Some of the confusion stemming 
from the terminological ambiguity is 
apparent in the recurrent discussions 
of such questions as whether factors 
are genotypic or phenotypic. As a sta- 
tistical entity, the factor is strictly 2 
function of covariation currently pres- 


‘ent in the observed variables and must 
be regarded as representing a pheno- 
typic correlational trend. The pheno- 
typic condition presumably has a his- 
tory and corresponds in some way to 
genotypic determiners. The essential 
question is whether from the statisti- 
cal entity we can make a fairly direct 
inference to a more or less unitary 
genotypic influence. If so, the factor 
may be interpreted in terms of this 
influence and considered, as a psycho- 
logical concept, to be genotypic. 
Nothing in the nature of the factor 
procedures could guarantee this conse- 
quence. Apart from a logical prob- 
lem regarding the nature of “unity” 
(which we shall consider below), the 
question is an empirical one, whose 
answer will probably vary depending 
on the character of the variables we 
choose to measure. In any case, a 
“factor” in one sense must be pheno- 
typic; in another sense, it may be 
genotypic. 

The various positions seen with re- 
gard to the psychological interpreta- 
tion of factors may be thought of in 
terms of preferences for various kinds 
of theoretical concepts. Elsewhere, the 
writer (Coan, 1963) has distinguished 
between adspective concepts, which are 
definable in terms of observed entities, 
events, or relationships, and ultra- 
Spective concepts, whose definitions 
contain factual content that exceeds 
the bounds of mere induction from 
past observation. Interpretation of a 
factor in terms of an adspective con- 
cept might consist of avoiding any in- 
ference beyond a present set of load- 
ings and treating the factor simply 
as a function of the observed variables. 
In such a case, one might say that 
interpretation is actually avoided, since 
it is allowed to rest with the product 
of our statistical labors. An ultra- 
Spective concept may take basically 
either of two forms—a hypothetical 
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concept or a fictional concept. The 
hypothetical concept refers to entities, 
events, or relationships which have 
not been observed but which are re- 
garded as potentially observable. The 
fictional concept, on the other hand, 
refers to entities, events, or relation- 
ships which are not assumed to cor- 
respond to fact and which are thus 
not regarded as potentially observable. 

A factor interpretation ordinarily 
takes the form of a hypothetical con- 
cept when the factor is interpreted 
in terms of a particular determinant 
lying outside the range of observation 
of the immediate study. This determi- 
nant might be a biological structure 
or condition, a form of social experi- 
ence, etc. Even if the interpretive 
concept corresponds essentially to 
previously observed entities and 
events, in such cases it will usually 
embody undemonstrated functional re- 
lationships as well. A more common 
form of interpretation consists of equat- 
ing the factor with a general be- 
havioral characteristic—say, an abil- 
ity or a personality trait—that might 
be defined operationally in terms of 
the kinds of performances that have 
been observed. This sort of interpre- 
tation ranges between the adspective 
and the hypothetical. It is clearly 
adspective in the rare instances where 
the tested performances are regarded 
as exhausting the realm of behavior 
in terms of which the concept is de- 
fined. It is still adspective, though 
less clearly so, when the observed be- 
haviors are regarded as representatives 
of classes to which they obviously be- 
long, so that factor interpretation gen- 
eralizes, in effect, to similar batteries, 
and to occasions other than that on 
which observations have been recorded. 
The induction, of course, may or may 
not be supported by subsequent obser- 
vations. The interpretation may be 
considered to take the form of a hypo- 
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thetical concept when the alleged gen- 
eral behavioral characteristic involves 
a clear qualitative departure from the 
observed performances—when, for ex- 
ample, the sampled behaviors are 
treated as indirect expressions of a 
more fundamental trait that is truly 
central to the factor. Even here, the 
interpretive concept preserves the 
basic features of the adspective concept 
in so far as it is definable in terms 
of behaviors that have been observed 
at some time. Its hypothetical char- 
acter resides in the implication of a 
pattern of covariation that goes be- 
yond what has been demonstrated. 

Some writers in this field object to 
any identification of a factor with an 
ability or trait, apparently regarding 
such interpretation as a form of reifi- 
cation or hypostatization. Anastasi 
(1938), for example, criticized Thur- 
stone’s treatment of factors in this 
vein, accusing him of resurrecting fac- 
ulty psychology by introducing a sharp 
dichotomy between ability and be- 
havior. Thurstone’s factor labels, how- 
ever, seem largely to constitute ab- 
stract disposition concepts operation- 
ally definable in terms of behavior. In 
formulating such abstractions, Thur- 
stone clearly reveals a preference for 
generalizing further beyond the im- 
mediate data than Anastasi would be 
inclined to do, but this hardly justi- 
fies a charge of hypostatization. 

It is possible that the core of the 
early dispute between Anastasi and 
Thurstone lies simply in a difference 
regarding preferred modes of verbal 
formulation. But it must be granted 
that many theorists do fashion inter- 
pretations in such a way as to imply 
reference to tangible underlying entities, 
and Anastasi is not alone in her concern 
with this problem. Burt (1941), too, 
has expressed alarm at what he per- 
ceives as a widespread tendency to 
reify factors. Thurstone leaves him- 
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self open to this sort of criticism 
the extent that he speaks of fact 
analysis as a method for isolatin 
“psychological entities” or ment 
faculties” (cf. Thurstone, 1935, 1938) 
While he regards his faculties as bein 
fundamental in the sense that the 
possess greater cohesiveness and s 
bility than do many specific forms 
behavior, however, it is nowhere evi 
dent that he regards them as havin 
an independent existence apart fro 
the behavior in which they are ex 
pressed. The situation is apparentl 
comparable for most of the other 
ers criticized by Burt. 

Of the major factor theorists 
Spearman (1927) appears to com 
closest to an interpretation of facto 
in terms of entities enjoying a cof 
crete status not reducible to behavior 
expression. He argues effective; 
against equating his g factor with anj 
particular kind of mental operation 
showing by his data that it appears s 
a factor in different kinds of operā 
tions. He argues instead for an in 
terpretation of g in terms of th 
“amount of general mental energ 
while he interprets his s factors 
terms of the “efficiency of specifi 
mental engines.” He speculates © 
the possibility of interpreting the “ef 
gines” more specifically as differen 
neural systems, but his primary ini 
est is in factor g, for which avail 
information affords little guidance fol 
formulation of specific physiologic 
hypotheses. For his purposes, he 5 
content to leave his mental ener 
in the form of a formalized and 
specified hypothetical concept. 
that matter, he feels that the con 
would effectively serve the purpos 
bringing systematic theoretical orde 
into the realm of intellectual perfo 
ance even if it were denied any d 
nite physical reality whatever. 
those who insist on such a denial 
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Spearman is willing to say that mental 
phenomena function as if such an en- 
ergy existed. This concession would 
accord mental energy the status of a 
fictional concept. Psychologists are 
more generally inclined today to see 
heuristic value in fictional physical 
models than they were at the time 
of Spearman's writing, but the fic- 
tional component of the energy concept 
may be too vague to serve any ef- 
fective purpose. Simply as a con- 
struct, of course, the concept serves 
the basic purpose for which Spear- 
man intended it. 

Other writers, such as Thurstone 
(1938, 1948) and Guilford (1940), 
indulge in comparable physiological 
speculation. For both Thurstone and 
Guilford, however, it is clear that a 
factor as psychologically interpreted 
carries no necessary implication of 
biological status. Instead, the physio- 
logical basis underlying the factor is 
treated as a matter of additional hy- 
pothesis. Thus, Guilford suggests that 
some factors will be found to have 
well-defined biological correlates in 
the individual, while others will not. 
In a given factor the physiological 
basis might be localized to a specific 
tissue or it might reside in some 
more general property of the organ- 
ism. Thurstone notes a close parallel 
between his first- and second-order 
factors and Spearman’s s and g and 
suggests, in accordance with Spear- 
man’s formulations, that the primary 
factors may have a relatively local- 
ized basis, whereas the more gen- 
eral second-order factors are likely to 
relate to more central or general bio- 
logical parameters. 

A somewhat different trend in fac- 
tor interpretation is that dominated 
by Burt (1941, 1949), who speaks 
of factors as “principles of classifica- 
tion.“ He objects to the use of 
terminology that appears to identify 
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factors with concrete causes, whether 
in terms of “faculties,” “entities,” 
“abilities,” “traits,” or “energy.” He 
insists that the factor does not repre- 
sent a component of the individual. 
Rather it is an “abstract component” 
that provides a convenient frame of 
reference for description. Essentially 
the same view is reiterated by many 
other writers, including Moursey 
(1952), Vernon (1950), and Eysenck 
(1947, 1951). 

The basic position expounded by 
Burt is probably accepted today by the 
vast majority of factor analysts. It 
is to Burt's credit that he has so 
clearly and competently brought the 
issue into focus. The writer is in- 
clined to believe, however, that much 
of what Burt perceives as a differing 
theoretical orientation is simply a pref- 
erence for a different mode of descrip- 
tion. One may speak of faculties, 
primary abilities, and unitary traits 
without attributing to these the funda- 
mental status of anatomical organs. 
Regardless of the descriptive labels 
which Thurstone chooses to employ, 
for example, there can be no question 
but that his basic argument for the 
use of factor methods is a defense of 
the scientific search for useful theo- 
retical constructs, not of a search for 
absolute units (cf. Thurstone, 1947). 

The issue is clouded, of course, be- 
cause of the subtle nature of changes 
in psychological theory. Psychologists 
do not merely reclothe old concepts 
in new guises. They sometimes sub- 
ject old terminology to reconceptuali- 
zation befitting a change in outlook. 
Thus our cellars are stocked with both 
“old wine in new bottles” and “new 
wine in old bottles.” In any case, fac- 
torists continue to identify factors with 
attributes of individuals—traits, abili- 
ties, etc.—but they seem generally to 
regard these attributes as rather arbi- 
trarily chosen concepts justified by 
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the needs of scientific theory construc- 
tion. A clear methodological difficulty 
would appear to arise only when a 
researcher regards his interpreted fac- 
tors as the ultimate objects of sci- 
entific investigation, as fundamental 
discoveries rather than theoretical 
constructions. His concepts then pre- 
sent a barrier to scientific progress, 
but not because they are hypothetical 
or constructive—for hypotheses and 
constructs are essential to scientific 
progress. They fail to serve science 
because they contain a hypothetical 
element that is not subject to confirma- 
tion. A factor interpretation in the 
form of a hypothetical concept is use- 
ful if the entities or relationships to 
which it refers can be examined in 
the light of further research. Simi- 
larly, a fictional concept may be help- 
ful, since its ultraspective component 
is recognized as a convenient construc- 
tion that is subject to replacement. 

In the interest of scientific progress, 
the writer would argue for a free 
and flexible use of different kinds of 
interpretive concepts. Liberal specu- 
lation can best serve its purpose, how- 
ever, if it is tempered with a clear 
recognition of the character and func- 
tion of the concepts. An understand- 
ing of the nature of theory construc- 
tion should obviate unnecessary delay 
in the abandonment of concepts that 
have outlived their usefulness. 

It may be of interest to note that 
beyond the realm of factor theory, 
the issue of hypostatization appears to 
concern theorists recurrently. It prob- 
ably arouses less alarm generally in 
psychology today than it did in the 
1930s and 1940s. Whether there is 
an actual danger in construing noun 
labels as representing “real entities” 
or “immaterial substances,” of course, 
depends on the meaning one attaches 
to these “real entities’—whether one 
regards them as having some kind of 
permanent or ultimate factual (truth) 
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status equivalent to that of mat 
objects. With their increasing 80 
phistication with respect to the some 
what arbitrary nature of constructs a§ 
ingredients of theory, psychologists 
have become justifiably less preoceu- 
pied with this issue. Perhaps at the 
same time, by virtue of this sophisti 
cation, hypostatization has come 
pose a less serious threat. 


MULTIPLICITY VERSUS PARSIMONY 


Another issue that has received con- 
siderable attention is that of the num- 
ber of factors or determinants required, 
in theory, to account for the observed 
correlations. In the relevant litera- 
ture, this issue has been thoroughly 
entangled with those in the sectio 
above. Much of the discussion ha 


theory. Thomson sees practical 
in the products of factor analysis. 
since men do perform intellectual tas 
as if only a few factors were operati 
but he maintains that for scientifi 
purposes we must think of menta 
performance as being governed by 
vast multiplicity of determina nts 
Psychologically, these 
might correspond to 
“bonds”; physically, they might 
respond to neurones or neural a 
Thomson has shown statistically 
a large number of specific dete 
nants may yield a single factor 0 
small number of group factors (thoug 
the relevance of this demonstration 1 
psychological data has been questione 
by other writers on various grounds) 
A similar position has been c 
pioned by Tryon (1935, 1958), w 
presents a logical analysis of test pe 
formance in terms of concepts 
adjustments as psychological deter 
nants and genes as biological determ 
nants. Tryon admits that extern 
agencies, in the form of cultural © 
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educational influences, may function in 
such a way as to create correlational 
clustering of psychological components, 
but he insists that in this case the 
broader unity reflected in the resulting 
factor resides in the external agency 
and not within the individual affected 
by it. Tryon objects to what he con- 
siders an improper emphasis on 
parsimony on the part of most factor 
analysts. He feels that if one is 
seeking psychobiological causes, it is 
not appropriate to seek a parsimonious 
set of factors, since the actual causes 
are numerous, Burt (1941) takes a 
similar stand on the matter of parsi- 
mony. He believes that simplicity of 
formulation might furnish some 
guarantee of truth in the physical sci- 
ences but that it cannot do so in a 
science like psychology, where the 
subject matter is much more com- 
plex. 

Both Burt and Tryon make the 
questionable assumption that those who 
advocate parsimonious theory believe 
nature itself to possess the simplicity 
embodied in theory. Yet it seems 
plausible that most factorists today 
would agree essentially with the views 
expressed on this matter by Thurstone 
(1947) and Kelley (1935). Both of 
these men argue, in effect, for a group- 
factor position on the grounds that 
it provides an economical set of con- 
structs. These constructs are viewed 
not as part of an underlying nature 
but as a scheme imposed on nature 
by human beings to serve human pur- 
Poses. Both writers recognize that 
the greater the number of constructs 
or components adopted, the more ac- 
curately and completely we can repre- 
sent the natural state of affairs, but 
they would also argue that scientific 
theory always simplifies and that the 
degree of simplification we introduce 
must be governed by our needs and 
Capacities as human beings to compre- 
hend. ) 


The present issue is, of course, 
bound up with the ancient problem 
of atomism versus holism, even though 
Thomson, for one, insists that he is 
not an atomist. The advocates of 
multiplicity, in any case, seem gener- 
ally to create a rather artificial di- 
chotomy between factual description 
and generalized theoretical statement. 
They assume that there is a certain 
level of description—that of genes, 
neurone arcs, concepts, mental bonds 
—at which our statements will corre- 
spond uniquely to the “truth.” This 
is presumably the level at which 
“things” are really things“ —but are 
they? If we were to probe further, 
might we not decide that each gene 
is but a vast hodgepodge of atomic 
particles, that each neurone are con- 
tains a complex string of semi-inde- 
pendent components, and that each 
emerging concept is analyzable into a 
host of momentary awareness? It 
seems to be increasingly recognized 
by scientists today that even the state- 
ments that appear most objectively 
descriptive represent a good deal of 
abstraction and construction on the 
part of those who issue them. In this, 
they differ only quantitatively from 
highly generalized theoretical formu- 
lations, for scientific statements at any 
level are a product of human perceivers 
and thinkers, and they are created 
to meet a human striving for under- 
standing. 

From this reasoning, it would cer- 
tainly follow that anyone has a per- 
fect right to seek an understanding of 
human behavior and experience on the 
level of neurones, genes, or specific 
mental bonds. These components 
have the same sort of reality as the 
elements of which they are composed 
and the aggregates which they form, 
but they have no unique claim to 
reality. Let us grant that much valu- 
able knowledge may be gained by 
studying anatomy with a microscope, 
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but let us not deny the existence of 
hands and faces on the grounds that 
extensive research with this instrument 
does not reveal them. It is true, as 
Tryon claims, that broad mental fac- 
tors represent somewhat arbitrary se- 
lections, but then any unit of analysis 
represents an arbitrary choice. The 
point is merely a bit more obvious 
in the case of a dimension than in the 
case of an entity. Mental factors may 
be just as meaningful as such arbitrary 
components of physique as height and 
weight. The fact that genetic analysis 
does not reveal them is beside the 
point. 

In the case of both entities and at- 
tribute dimensions, the scientist must 
choose those units or components 
which are optimally useful for a given 
theoretical purpose. Thus, the cell is 
a more useful unit of analysis for most 
physiological problems than, say, the 
chromosome or gene, Similarly, the 
most convenient unit for many prob- 
lems of physiological psychology may 
prove to be a fairly complex neural 
network rather than the individual 
neurone. It is appropriate to formu- 
late scientific statements in terms of 
factors, in preference to bonds or 
genes, when our immediate theoretical 
interests lie at the level of description 
to which factors correspond. The 
writer would contend that the theoreti- 
cal concerns of psychologists most 
often—but not always—lie approxi- 
mately at the factor level, 

One point on which Tryon, Burt, 
Kelley, and Thurstone would all ap- 
parently agree is that the law of parsi- 
mony is a man-made principle rather 
than a law of nature. They would 
agree that its purpose is convenience 
of description, while differing with re- 
gard to the scientific use to be made 
of such description. If we grant the 
general point, we must also grant, to 
be consistent, that no particular form 
of simplicity has any special claim to 
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truth, and we must concede to Thur- 
stone’s critics the fact that Thurstone 
has not always been this consistent. 

In the realm of factor methods, 
simplicity is subject to a variety of 
interpretations. The most basic one, 
on which factor analysts generally 
agree, is an economy of constructs, 
We seek to account for a given num- 
ber of variables in terms of a smaller 
number of factors. Once we have 
agreed on the number of factors to 
be employed, we note that from a 
mathematical standpoint there is an 
infinite number of sets of factors that 
will account equally well for our data. 
Of these innumerable sets, Thurstone 
would argue that the one best set is 
that which best satisfies his criteria 
for “simple structure.” Unfortunately, 
Thurstone (1938) has sometimes 
spoken as if his criteria for simple 
structure constituted some kind of 
fundamental discovery. In the writ- 
er's opinion, they must be considered 
an arbitrary set of rules, though if the 
individual investigator is permitted to 
use them elastically, they furnish the 
most generally satisfactory basis for 
choosing a rotational solution that any- 
one has yet devised. 

It is widely recognized that the 
criteria for simple structure can be 
met more readily if obliqueness among 
rotated factors is permitted, but 
obliqueness itself constitutes an ad- 
ditional form of complexity. The 
writer (Coan, 1959) has previously 
argued that, on the whole, oblique 
solutions are superior to orthogonal 
ones, but it must be emphasized that 
the two types of solutions furnish 
incommensurable forms of simplicity 
—factor simplicity and model sim- 
plicity. In such a choice between 
construct models, neither alternative is 
inherently more valid than the other. 
For that matter, one might even argue, 
in line with Burt’s (1941) reasoning; 
that an unrotated least-squares solu- 


tion has its own special claim to sim- 
plicity. It is this solution that enables 
us to account maximally for the ob- 
served variance with any possible 
number out of a larger number of ex- 
tractable factors. For purposes of 
theory construction, however, the un- 
rotated solution is difficult to defend. 
It is too dependent on the accidents 
of variable selection to yield the neces- 
‘sary reproducibility from one study to 
another. 

The folly in attributing ultimate 
truth to any one type of solution is 
underscored by the fact that our earlier 
choice of number of factors is in- 
evitably arbitrary in any empirical 
problem. This decision depends pri- 
marily on the importance that we at- 
tach, respectively, to precision and sim- 
plicity of description, but it is 
complicated by the obvious fact that 
simplicity with respect to number of 
factors is inversely related to the 
attainable simplicity of individual fac- 
tors in a rotated solution. In any 
case, this decision delimits the possible 
redistributions of variance through 
‘subsequent rotation. In the writer’s 
Opinion, we can usually best meet our 
theoretical needs by employing a lib- 
eral number of factors thoroughly ro- 
tated to an oblique solution. 

In the writings of Thurstone and 
his followers, the desiderata of factor 
constructs are often spoken of in terms 
of “functional unity.“ To some audi- 
ences, to speak of a factor as repre- 
senting a functional unity is to imply 
that it represents some underlying in- 
divisible element. Thurstone, how- 
Sver, has long made it clear that the 
factors or “faculties” he seeks do not 
Constitute ultimates and that they may 
be broken down into further elements 
(cf. Thurstone, 1938). In this, he 
grees with those who regard factors 
as “principles of classification” subject 
to any desired degree of subdivision 
(cf. Burt, 1949; Moursey, 1952). 


Facts, Factors, AND ARTIFACTS 


131 


In essence, to say that a factor repre- 
sents a functional unity is simply to say 
that it represents a reproducible pat- 
tern of covariation. 

The matter needs to be examined 
further, however, because the concept 
of functional unity is often intended 
to imply some ill-defined cohesiveness 
of variables underlying this covaria- 
tion. We must bear in mind that R- 
technique correlations and the factors 
that we derive from them represent 
directly correlational trends running 
through a group of subjects. It is 
important that we entertain the ques- 
tion of the extent to which we can 
properly infer from this covariation a 
functional interdependence of variables 
within individuals. In proposing a dis- 
tinction between object factors and 
population factors, the writer (Coan, 
1959) has previously suggested that 
the inference will prove to be justified 
in some cases but not in others. The 
issue is related to that raised long 
ago by Allport (1937), who said that 
factors represent only “average tend- 
encies.” In speaking of the “statisti- 
cal” establishment of traits, however, 
even Allport (1931) appears to as- 
sume that the kind of inference in- 
volved here is warranted, at least up 
to a point. Moderate support for the 
inference has been furnished in recent 
years by the partial correspondence of 
R-technique and P-technique factors 
derived from a common measurement 
realm. Be that as it may, the question 
is still with us, and it deserves closer 
attention if we are going to build 
psychological theories on a correla- 
tional foundation. On logical grounds, 
we can at best argue for the plausi- 
bility of an inference of dynamic inter- 
dependence. We obviously need more 
in the way of direct empirical investi- 
gation of the matter. 

In future explorations of this prob- 
lem, we should bear in mind that apart 
from its fundamental meaning in terms 
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of transpersonal covariation, functional 
unity may imply any of several dif- 
ferent kinds of interrelationships. For 
one thing, it may imply a high level 
of causal interaction among elements, 
each directly affecting the others to a 
greater extent than it affects elements 
forming other factors. A second pos- 
sibility is that constituent elements of 
the functionally unitary trend have a 
common source in a unified matura- 
tional process or a unified deter- 
mining environmental context. Certain 
learned performances (e.g., that dis- 
played in a vocabulary test) clearly 
illustrate this kind of unity. A third 
possibility is that the constituent ele- 
ments display a common behavioral 
effect, or at least a convergence of 
effect, which tends to reduce their 
mensural separability. Whatever the 
multiplicity of influences, anatomical 
organs and physical dimensions dis- 
play this sort of unity. Obviously 
some psychological or behavioral fac- 
tors may function in the same way. 
These three conditions are neither 
mutually exclusive nor necessarily con- 
comitant. It is almost inevitable that, 
in forming psychological interpreta- 
tions of factors, we shall assume some 
kind of functional unity beyond that 
directly demonstrated by group cor- 
relation data. If we are to make the 
most precisely meaningful interpreta- 
tions and formulate the most appropri- 
ate hypotheses, we must remember 
that “functional unity” is not itself a 
unitary concept. 


THE CAUSAL Nature or FACTORS 


Those who object to the “reifica- 
tion” of factors often express concern 
over the possibility that factors will 
be interpreted as causal entities. Thus, 
warnings against causal interpretation 
have been uttered by Thomson (1935), 
Anastasi (1938), Eysenck (1947), 
and Burt (1941, 1949). Burt (1941) 
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does grant that factor analysis may 
yield products corresponding to causal 
entities, though the process of factori- 
zation does not guarantee this. Its 
immediate products, in any case, are 
phenotypes rather than genotypes. 
Burt further argues that explanation 
in terms of causal agencies is, after 
all, an outmoded approach which has 
no place in twentieth century science. 

The counter view has long been ad- 
vocated by Cattell (1957), whose in- 
terpretation of the factor as a source 
trait is deliberately intended to imply 
that the factor operates as a cause 
with respect to the observed “sur- 
face” correlations. Even in those oc- 
casional places where he makes a dis- 
tinction between factor and source 
traits and speaks of the source trait 
as the “genotype of which the factor 
is the phenotype,” there is still im- 
plicit the notion that the factor corre- 
sponds rather closely to a cause, in 
any case more closely than the corre- 
lations from which it is mathematically 
derived. Cattell assumes that the 
cause may take any of a variety of 
possible concrete forms. It may be 
something physical, physiological, 
social, or psychological in nature. 
Though the issues involved in the 
controversy between Burt and Cattell 
are basically more logical than factual, 
Cattell has recently sought to provide 
empirical support for his position (Cat- 
tell & Sullivan, 1962). 

It has already been suggested that 
one source of difficulty is the elastic 
usage of the word factor. A more 
serious problem in the present contro- 
versy, however, is the failure of those 
who take part in it to agree on a 
common concept of causality. The 
word cause has long been used in a 
variety of ways, and despite a peren- 
nial concern with the matter extending 
far back into antiquity, no standard- 
ized usage has ever been adopted in 
philosophy or science. The concept 
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displays two general properties which 
are fairly constant. First, it refers to 
something which is somehow function- 
ally related to an effect. Second, 
whether we speak of a cause or of the 
cause, we perform an arbitrary selec- 
tion of some ingredient or aspect out 
of a more inclusive determining field 
of interaction. However the cause is 
designated, it constitutes but one of 
innumerable possible abstractions that 
might be treated similarly. 

A causal concept may embody any 
of several possible kinds of functional 
relationships. Thus, a “cause” may 


constitute any of the following: (a) 


an antecedent event (fire or the act 
of igniting as a cause of smoke), (b) 
a disposition property defined in terms 
of, and manifested in, the effect (grav- 
ity as a cause of falling, the magnetic 
property of an iron bar as a cause 
of the movement of iron filings), (c) 
a whole of which the effect is a part 
or a specific case (A’s greater general 
intelligence as a cause of his solving 
Problem X more rapidly than B does), 
(d) a part of the whole manifesting 
the effect (helium as a cause of the 
dirigible’s rising), (e) a characteristic 
of the system in which the effect oc- 
curs, 

These five categories are neither ex- 
haustive nor mutually exclusive. The 
first refers to causes in a genetic or 
historical sense. The antecedent event 
may be one which precedes the effect 
either invariably or with sufficient re- 
liability to permit better than chance 
Prediction from cause to effect or 
from effect to cause. The remaining 
categories represent ahistorical inter- 
Pretations of cause. Of these, the 
last is the most general, for it obvi- 
ously overlaps with b, c, and d. Cate- 
gory b offers the most clearly re- 
dundant form of explanation, except 
in so far as the cause embraces a 
greater variety of events than the 
effect explained—in which case the 
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cause is also of Type c. In a broad 
sense, perhaps all causal explanation 
is ultimately redundant. 

Those who speak of factors as 
causes generally do so in a broad 
ahistorical sense, adopting most clearly 
Interpretation e but making varying 
use of the other interpretations. Thus 
the factor, as a basic characteristic of 
the behaving organism, is regarded as 
a constituent of the contemporary field 
in which a given behavioral effect oc- 
curs (e). At the same time, it may 
be defined in terms of a predisposition 
to the behavioral manifestations con- 
stituting its effects (b), but viewed 
as possessing greater generality than 
any given effect so that it is subject 
to various forms of expression (c). 
While not defined in terms of ante- 
cedents, it is commonly assumed to be 
traceable to earlier biological or ex- 
periential sources and to have a 
longer prior history than does its 
immediate effect. It is this temporal 
generality, or stability, of the factor 
that makes the factor concept a tool 
for prediction as well as for general- 
ized description. 

The notion of generality, both in a 
temporal sense and with respect to 
diversity of effect, is evidently basic 
to Cattell’s concept of cause. After 
a careful logical analysis of the re- 
lationships among loaded variables, 
primary factors, and second-order fac- 
tors, Cattell (1957) concludes that 
specific variables must be regarded as 
consequences of factors and factors as 
causes of variables. For the same 
reason, second-order factors must be 
regarded as playing a causal role in 
relation to primary factors. These 


1Jt should be noted that the distinction 
between first- and second-order factors is 
an arbitrary ingredient of the factor model. 
As Schmid and Leiman (1957) among 
others have shown, the second-order factor 
is subject to mathematically equivalent rep- 
resentation as a broad factor at the first- 
order level. To apply his reasoning con- 
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conclusions follow from the facts that 
Cattell considers only if one adopts the 
premise that effects are more varied 
than underlying influences. 

Rather similar reasoning appears in 
a more recent work by Guilford 
(1959a), who tends to avoid causal 
terminology. Adopting a hierarchical 
conception of behavioral organization, 
Guilíord says that “determination” 
within the hierarchy tends to operate 
in a downward direction, i.e., from 
the more general to the more specific 
characteristics. He believes also that 
prediction works most efficiently in 
the same direction. 

Those who display greatest alarm 
at references to causes seem concerned 
lest we postulate some invisible motor 
within the machine. Yet basically, the 
arguments in favor of factors as causes 
rest on the idea that general character- 
istics in some way determine specific 
acts. A “cause” in this sense is a 
legitimate construct or abstraction as 
long as we make our usage clear. 
It is justifiable to speak of the whole 
as determining the part in so far as 
we find it convenient to order our 
thinking in this way, but we must not 
pretend that we are stating a funda- 
mental law of nature. If this order- 
ing leads to better prediction than its 
opposite, it is largely because deduc- 
tion is more efficient than induction. 
Undoubtedly, many psychological 
problems relating to “underlying in- 
fluences” and “depth,” both inside and 
outside the realm of factor theory, are 
operationally reducible to problems of 
generality versus specificity. No 
doubt, some confusion might be pre- 
vented if we avoided the use of such 
terms as cause, depth, and genotype 


sistently to the empirical realm, Cattell 
would have to argue that under some con- 
ditions a broad factor will bear a casual re- 
lationship to a narrower, linearly inde- 
pendent but cooperative factor of the same 
order. 
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in referring to general characteristics, 
The use of the word cause in resped 
to factors has led to a rather sterile 
controversy. 


Factor GENERALITY AND LEVELS 
or DESCRIPTION 


There are a number of other prob- 
lems that relate to the generality of 
factors and description. It is an ob 
vious fact that factors constitute 
more general descriptive level i 
the variables from which they 
derived, and it is equally obvious that 
within the realm of factors and within 
the realm of experimental variables 
there is a range of variation in gel 
erality. The possible relationship be- 
tween these descriptive levels and 
levels of organization in the mind of 
behavioral system has long occupied 
the interests of various theorists. 

Most treatments of this matter are 
outgrowths of the hierarchical view- 
point formulated by Burt (1941, 
1949). Burt himself traces his view 
of mental organization to Herbert 
Spencer. Burt distinguishes four 
types of characteristics corresponding 
to the four possible types of factors 
commonly called general, group, spe 
cific, and error. He regards the fo 
factor theorem as fundamental, in as 
much as an adequate conception of 
the mind requires recognition of char- 
acteristics on all levels from the unt 
versal to the accidental. 

A number of other writers—includ 
ing Vernon (1950), Eysenck (1951): 
Moursey (1952), and Guilford (19593 
1961)—have adopted Burt’s model. 
These men all accept a scheme in- 
volving four levels, though they va N 
somewhat in the psychological identt 
ties attributed to the levels. A system 
of four levels represents a rather clean 
set of logical distinctions when one iS 
distinguishing types of factors or pos- 
sible degrees of distribution of chat- 
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acteristics. As we move from types 
of factors to corresponding levels of 
mental or behavioral organization, 
however, adherence to a fourfold 
scheme becomes much more clearly 
arbitrary. This should be obvious 
from the mere fact that a factor may 
change from one level to another de- 
pending on the test battery. Of course, 
such facts are generally recognized. 
Though Moursey attempts to make an 
empirical test of the hierarchical model, 
he notes the artificial simplicity of 
the model and insists that only an ap- 
proximate correspondence between the 
model and actual mental organization 
is to be expected. Guilford appar- 
ently makes similar assumptions. 
Eysenck stresses the formal, abstract 
character of the model and the rela- 
tivity of distinctions in factor level. 
Both Eysenck and Guilford are pri- 
marily concerned with personality 
Structure, and they both recognize es- 
sentially the same levels of behavioral 
Organization—those of specific re- 
sponses, habitual responses (in Guil- 
ford’s terms, the hexis level), traits, 
and types. Both men recognize, too, 
that obtained factors do not bear any 
fixed relationship to these four levels. 
It should be granted that the sort of 
Pyramidal hierarchy espoused by these 
writers affords a convenient frame- 
work in terms of which we can relate 
aspects of personality differing in gen- 
erality of effect. For such a purpose, 
the four levels constitute justifiable 
Constructs. A basic problem for which 
there is still no completely adequate 
solution, however, is that of deciding 
how to make appropriate inferences 
from empirical factors to these levels 
of behavioral organization. Early fix- 
ation on a fourfold scheme of stratifi- 
cation might well serve to obscure 
Some of the underlying issues and 
Interfere with progress in their solu- 
tion. Perhaps we ought to give more 
explicit recognition to a basic con- 


tinuum of generality and speak of de- 
gree of generality rather than level of 
organization. We shall then be better 
prepared to cope with a problem that 
involves, on the one hand, a variable 
spread of generality among factors, 
among experimental variables, and be- 
tween factors and variables, and on 
the other hand, a fluid correspondence 
of these factors and variables to be- 
havioral structures. 

To see more clearly the nature of 
the problem, let us first distinguish 
clearly between contextual generality 
and referent generality. As used here, 
contextual generality refers to the ex- 
tent to which a factor loads the ex- 
perimental variables in a given set of 
data—i.e., it refers to generality 
within a given mathematical context, 
without regard to the psychological 
realm represented by the data. Thus 
in a given study, a so-called general 
factor would possess greater contextual 
generality than a group factor, and 
a second-order factor would ordinarily 
possess more than a first-order factor. 
For factors of all orders, contextual 
generality may best be expressed 
mathematically in terms of the total 
variance accounted for within the ex- 
perimental variables. Referent gen- 
erality, on the other hand, would refer 
to the scope of reference of the varia- 
able or factor in the behavioral or 
mental system in terms of which it is 
psychologically interpreted—i.e., the 
variety of behaviors or mental activi- 
ties to which it relates and the degree 
to which it relates to them. 

There is an imprecise correspond- 
ence between contextual generality 
and referent generality. Within a 
given set of factors, a second-order 
factor will probably possess greater 
referent generality than a primary fac- 
tor, but a second-order factor in one 
study may possess less referent gener- 
ality than a primary factor in another 
study. Furthermore, two first-order 
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factors in a given set, with comparable 
loading patterns, may differ greatly in 
referent generality, depending on the 
variables loaded. The experimental 
variables themselves have equal 
mathematical status in a given study 
and are, hence, regarded as identical 
in contextual generality, but they may 
vary enormously in referent general- 
ity. 
Among the conditions relevant to the 
two kinds of generality, we must rec- 
ognize all of the following: 


1. Selection of subjects: As Vernon 
(1950) has noted, both the degree and 
type of selection may affect the factor 
pattern. If a factor remains recog- 
nizably the same as we go from a 
relatively unrestricted sample to a 
sample restricted on relevant vari- 
ables, the loadings decrease in magni- 
tude. In short, its contextual gen- 
erality tends to decline, though refer- 
ent generality may remain unchanged. 
Vernon argues that a high degree of 
selection favors the emergence of 
group factors in place of a general 
factor. 

2. Number of factors extracted 
from a given set of variables: Suc- 
cessive centroid and principal-axis 
factors contain progressively smaller 
variance, and subsequent rotation 
tends to distribute the available vari- 
ance more evenly. Hence, each ro- 
tated factor within a large set will 
tend to be of smaller variance and 
possess both less contextual general- 
ity and less referent generality than a 
factor within a smaller set derived 
from the same variables. 

» 3. Kind of rotational solution: 
Oblique rotation tends to redistribute 
variance more evenly than does orthog- 
onal rotation. Hence, orthogonal fac- 
tors tend to display a greater range 
of contextual generality, and corre- 
spondingly of referent generality, some 
being more general and some less gen- 
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eral than any of the oblique factors, 
This trend is illustrated by a previous 
study of the writer (Coan, 1959). 

4. Referent generality of scoring 
composite: The factor, of course, will 
display higher referent generality than 
the scored variables from which it is 
derived (unless it is a “singlet” fac- 
tor), but the higher the referent gen- 
erality of the variables, the higher will 
be that of the factor, whatever its 
contextual generality. The discrepancy 
between the factor and the variable 
in degree of referent generality will, 
of course, be determined by the other 
conditions treated in this list. The 
referent generality of the scored vari- 
able will be a function both of the 
basic measurement operation (rating 
of general traits, recording of specific 
movements, etc.) and of the mode of 
combination of scoring units (items) 
into the composite that constitutes @ 
correlatable score. 

5. Density of variable sampling: 
Vernon (1950) has noted that one 
may increase the contextual generality 
of a factor substantially by specializ- 
ing the battery. In so far as the 
psychological identity of the factor 1$ 
not altered in the process, referent 
generality would remain unchanged. 
Density of sampling, however, ab“ 
determines what factors will emerge. 
We may expect relatively dense sam- 
pling within a given behavioral do. 
main to yield factors not too far above 
the variables in degree of referent 
generality. Sparse sampling is con- 
ducive to the direct emergence ° 
factors that would be of a higher orde! 
if dense sampling had been used. 


A consideration of the above point 
suggests that, in place of a four-levet 
hierarchy, we might better conceptual- 
ize the psychological implications g 
factors in terms of the scheme show? 
in Figure 1. In this figure, the vert? 
cal dimension represents a continu 
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Fic. 1. A modified hierarchical representation of mental and behavioral organization. 


of referent generality. The circles 
represent experimental variables, and 
the squares and triangles represent, 
respectively, first- and second-order 
factors derived from them, The ele- 
ments thus shown in the figure may 
be construed as compiled from a va- 
riety of studies, the dotted lines indi- 
cating demonstrated loadings. Hori- 
zontal spacing may be regarded as an 
inverse function of correlational affin- 
ity (and perhaps of a corresponding 
relatedness within the individual), 
though if we wished to fit actual em- 
Pirical data to such a figure, we should 
have to regard any horizontal array 
as the trace of a multidimensional con- 
figuration in which proximity faith- 
fully represented correlation (by any 
convenient geometric convention). 

As the figure is intended to suggest, 
Variables, primary factors, and second- 
order factors may occur at all eleva- 
tions of referent generality. Within 
a single study, their vertical succes- 
Sion is essentially orderly, but vertical 
discrepancies within the context of a 
Single study are subject to variation. 


Factors B and C illustrate the effects 
of dense and sparse sampling of vari- 
ables, while the difference between B 
and D is of the sort that might be 
governed by decisions regarding the 
number of factors to extract or the 
choice of rotational solution. 

It is well known that different in- 
vestigators in the personality field se- 
lect variables of different degrees of 
referent generality and extract factors 
of different degrees of generality. The 
factors of Eysenck, for example, are 
obviously of higher referent general- 
ity than those of Cattell—largely be- 
cause he extracts far fewer factors, 
but to some extent because of differ- 
ences in method of rotation and vari- 
able selection. Within Cattell’s work, 
there is probably considerable general- 
ity range in variable selection. Most 
notably, there seems to be a tendency 
for his objective test variables to be 
of higher referent generality than his 
rating and questionnaire variables, 
with the result that primary objective 
test factors tend to align with second- 
order rating and questionnaire factors, 
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as in the case of Factors A and I in the 
diagram. The discrepancy between 
measurement realms may also be at- 
tributable, in part, to denser variable 
sampling in the questionnaire and rat- 
ing realms (which would lead, inci- 
dentally, to a greater obliqueness of 
primaries and clearer definition of sec- 
ond-order factors in these realms). A 
given number of objective test vari- 
ables are likely to encompass a greater 
range of behaviors than an equal num- 
ber of questionnaire items. 

It also seems plausible that Cattell’s 
questionnaire and rating variables, in 
contrast to his objective test variables, 
are at a relatively uniform elevation in 
the continuum of referent generality 
and that they yield factors which, 
correspondingly, are fairly uniform. 
The objective-test variables undoubt- 
edly cover a wide range of referent 
generality, and the resulting factors 
probably display a comparable verti- 
cal spread. Thus, Factor U. I. (T) 
24 (Anxiety), which matches a sec- 
ond-order questionnaire factor, dis- 
plays high loadings for a set of rather 
generalized verbal measures (cf. Cat- 
tell, 1957). In contrast, Factor U. I. 
(T) 22 (Corticalertia) loads a va- 
riety of simple tasks involving alert- 
ness and speed. It seems likely that 
this factor is of lower referent gen- 
erality than U. I. (T) 24, unless we 
can assume that it draws from a suffi- 
ciently wide range of variables so 
that it is of considerably higher ref- 
erent generality than the variables that 
it loads. 

As Guilford (1959a, 1961) has 
pointed out, there is a danger of con- 
fusion in studies involving a mixture 
of variables and of factors of different 
degrees of generality. Guilford sug- 
gests, as a remedial strategy, that we 
ought ideally to start at the lower 
levels and work our way upward. 
Since no one has yet followed such 

a plan consistently, it is difficult to 
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say how practical it would prove, 
Perhaps it would prove more feasible 
for some areas of behavior than for 
others. In any case, we are 
faced with the problem of determining 
more objectively the measurement 
level of whatever variables we - 
sider and obviating the sort of guess- 
work in which the writer has just 
indulged. 

Assuming that our researches 
ultimately cover the entire continuum 
of referent generality, perhaps what 
we need in any single investigation i 
not so much low generality sampling 
of variables as uniform generality and 
uniform density sampling. The aim 
of such sampling, of course, would 
be to obtain factors of relatively uni- 
form generality. The desired gen- 
erality of either variables or factors 
may best be determined by our im 
mediate needs for scientific descrip- 
tion. It is unlikely that a sound pro 
cedure for satisfactory sampling can 
be prescribed at this time, and the 
problem is likely to require a consider- 
able amount of exploratory work with 
all sorts of measurement variables. 
A possible solution might lie in se- 
lecting variables out of a much larger 
potential set on the basis of their 
patterns of correlation. For example, 
if an initial large sample of variables 
were designed to cover very intensively 
a certain band within the hierarchical 
system for a given realm of observa 
tion, we might select variables for 
factorization on the basis of the dif- 
fusiveness of their correlations with 
other variables. The hierarchical model 
assumes implicitly that the higher the 
level of generality, the more the vari- 
able will tend to display moderate 
correlation with many other variables 
and very high correlation with few 
other variables. The major obstacle 
in the way of applying such a pro 
cedure is the difficulty of satisfyi 5 
other requisite assumptions regarding 
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the nature of initial sampling and the 
uniformity of mode of organization 
within the total system. 

Perhaps further exploration will 
suggest entirely different modes of 
attack on the whole problem. Cattell 
has long been an advocate of compre- 
hensive sampling of variables within 
any realm one chooses to study, and 
the advent of computers has helped 
to make this aim appear reasonably at- 
tainable. In terms of our hierarchical 
model, we can now handle samples of 
great horizontal scope. Perhaps in 
this age of computers we can also find 
a basis for refining our selection in 
terms of density and generality, A 
methodological and empirical founda- 
tion will then have been laid for con- 
structing an integrated picture of hu- 
man behavior and experience that will 
encompass both the most general ten- 
dencies and the most specific actions. 

It will probably occur to some read- 
ers that Guttman’s facet theory (cf. 
Guttman, 1954-55, 1958) provides the 
basis for a different approach to the 
problem of uniform generality sam- 
pling—particularly since Humphreys 
(1962) has recently advocated the 
facet model as a “more general way 
of thinking about psychological tests” 
than the hierarchical model. Hum- 
phreys argues that the hierarchical 
model breaks down in the case of the 
multitrait-multimethod matrix of 
Campbell and Fiske. The hierarchical 
model “breaks down” in a multifacet 
system, of course, only in the sense 
that the subordination pattern within 
the hierarchy becomes complex. Fur- 
thermore, one may question whether 
the facet model is really an alternative 
to the hierarchical model, since it seems 
to serve a different purpose and to rest 
on a different logical basis. Facet 
theory is based on a logical analysis 
of test structure, whereas the hier- 
archical model (as conceived by Burt) 
Tests on a logical analysis of trait dis- 
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tributions (more precisely, in its im- 
plementation, the distribution of traits 
over a sample or population of be- 
haviors). 

Facet theory affords a basis for its 
own species of hierarchy, one in which 
levels correspond to orders of inter- 
action between the facets of a design. 
Such a hierarchy is valuable in its own 
right, but the kind of generality that 
is thus represented is logically distinct 
from both contextual generality and 
referent generality. Even when test 
sampling is deliberately “heterog- 
enized” with respect to facet combi- 
nations in a given research, there is 
nothing to guarantee a close corre- 
spondence between facet interaction 
level and the other kinds of generality. 
Guilford’s (1956, 1959b) recent classi- 
fication of intellectual factors may be 
construed as representing an alignment 
at one interaction level of factors that 
are hopefully of relatively uniform ref- 
erent generality. We may well find in 
future research, however, that the 
structure that proves most empirically 
meaningful at higher levels of referent 
generality in the realm of intellectual 
performance bears little resemblance 
to anything suggested by the facet 
model, 

Before dismissing facet theory as 
irrelevant to our purposes, we should 
recognize the advantages it would 
offer if referent generality could some- 
how be linked to the facet model. 
If and where such a correspondence 
can be established, the facet model 
would provide a basis for identifying 
referent generality independently or 
semi-independently of contextual gen- 
erality, permitting allowances for the 
peculiarities of variance distributions 
in specific sets of data. Such an 
eventuality may depend on the de- 
velopment of appropriate techniques 
for identifying facets and elements 
within them. Basically, the problem 
of uniform generality sampling re- 
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mains an empirical problem which can 
best be conceptualized in terms of a 
modified hierarchical model. Hum- 
phreys, however, has performed a 
valuable service in suggesting the rele- 
vance of facet theory to this model. 
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A METHOD OF ANALYSIS AND CLASSIFICATION 


OF REPETITIVE RESPONSE SYSTEMS* 
ROBERT B. LOCKARD 


University of Washington 


Many animal response systems are characterized by a stable rate 
component. For example, daily caloric intake is stable in a stable 
environment. Many such response systems supply reservoirs (eg, 
drinking) with a characteristic drain rate. Thus a “theoretically 
perfect” reservoir system would accumulate a deficit when denied 
inflow (e.g., denied drinking behavior) but would exactly compensate 
for the accumulated deficit when inflow was again made possible. 
This basic model is extended to all repetitive response systems, in- 
cluding those without known reservoirs, and those which do not 
compensate for deprivation. Thus a quantitative taxonomy of diverse 


behaviors becomes possible. 


It is not the purpose of this paper to 
propose a method for the analysis of 
learning curves. These, broadly speak- 
ing, pertain to response systems in 
transitional states as a consequence of 
a change in the functional properties 
of the environment. The class of 
phenomena to which this paper applies 
is stabilized response systems operating 
in a constant environment. Some of 
these systems are so regular with re- 
Spect to response frequency per unit 
of time that rates are easily computed 
and are useful as dependent variables 
or as indicators of other events. For 
example, we speak of heart rate and 
respiratory rate. Certain other re- 
Sponse systems, such as activity-wheel 
Tunning rate, vocalization rate, and the 
Tate of other striped muscle activities, 
are not in any obvious way a part of 
conservative homeostatic systems. Yet 
Many such systems display a temporal 
regularity over the appropriate time 
base. It is the purpose of this paper 
to show that many repetitive response 
Systems can be analyzed by the same 
Set of operations. Two quantitative 


The author wishes to express an intel- 
lectual debt to William S. Verplanck, who 
Originally conceived the fundamental ideas 
this paper merely developed. 


system parameters are derived from 
the way and extent that a given sys- 
tem responds to deliberately intro- 
duced perturbations. Since numerous 
response systems can be measured in 
the same way, a quantitative taxonomy 
becomes possible. This general ap- 
proach might be called “behavioral 
staties“ the analysis of behavior be- 
fore or after a learning situation, as 
contrasted with the traditional concern 
of altering behavioral rates through 
the introduction of contingencies. 
One class of response systems ulti- 
mately involves a reservoir of some 
kind. Breathing, eating, and drinking 
might be said to fill reservoirs. Urina- 
tion and defecation drain upon reser- 
voirs. The behavior associated with 
each of these systems is identifiable 
and repetitive; the animal breathes, 
eats, drinks, and defecates repeatedly. 
These different responses are also 
characterized by rates. When cumu- 
lative responses are plotted as a func- 
tion of time, the rate of the ongoing 
behavioral system appears as the slope 
of the function. If the animal is de- 
prived of the opportunity to respond 
by plugging the nostrils, withdrawing 
food or water, or closing the urethra, 
the response rate becomes zero. When 
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Fic. 1. Example of a “theoretically per- 


fect” response system. (The characteristic 
response rate, rı was forced to zero at time 
D. At Time R the opportunity to respond 
was restored. The system compensated for 
the deficit accumulated during deprivation, 
then returned to its characteristic rate.) 


the opportunity to respond is restored, 
what might be called “replacement be- 
havior” takes place; there is an initial 
high rate of response which continu- 
ously decreases until, at some later 
time, the response rate stabilizes. This 
typical course of events is illustrated 
in Figure 1. The predeprivation rate, 
fı, is forced to zero by experimental 
procedures at the point in Time D (for 
deprived). When the opportunity to 
respond is restored (at R), replace- 
ment behavior occurs. The system 
reaches rate stability rə, which is de- 
picted as being equal to r}. At the 
time of restoration of the opportunity 
to respond, reservoir systems would 
have a theoretical deficit (need) equal 
to the difference between r,, an extra- 
polated (or control group) rate, and 
the response function at Time R. The 
deficit results from the organism’s drain 
upon material in the reservoir, and 
replacement behavior serves to restore 
the reservoir to its normal psychologi- 
cal parameters. Thus Figure 1 de- 
picts a reservoir system as showing 
complete, or 100% replacement. 

Certain other response systems, 
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though repetitive and stable in rate, 
serve no known reservoir function. 
Examples of these behavioral systems 
would be the manipulation behavior of 
monkeys, running wheel activity of 
rodents, and saccharine ingestion. Fig- 
ure 2 illustrates the response of the 
hypothetical nonreservoir system to the 
same deprivation procedures employed 
in the first example. At Time D, the 
animal was deprived of the opportunity 
to respond, thus forcing the response 
rate to zero. After an interval of 
deprivation, the opportunity to re- 
spond was restored at Time R. In 
this example, replacement behavior oc- 
curred but was not complete. The 
theoretical deficit at Time R was re- 
placed by 20%. At about Time T, 7 
stabilized and became equal to the pre- 
deprivation rate rı as™their equal 
slopes indicate. The deficit, however, 
was never further diminished. This 
illustration suggests that behavioral 
systems are not properly dichotomized 
into replacement and nonreplacement 
(or drive and nondrive) systems, but 
that replacement is a quantifiable con- 
tinuous variable. 

Those behavioral systems without 
known reservoirs and which allow sur- 
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Fic. 2, Example of a repetitive response 
system which, following deprivation, partially 
compensates for the theoretical accumulat 
deficit. 


following infinite deprivation in- 
are those which have the most 
‘controversial drive status. One typi- 
cal experimental procedure employed 
“to test for drivelike effects of an un- 
known system allows a deprived ex- 
perimental group and an undeprived 
“control group to respond again at 
Time R of Figure 2. At Time T, or 
‘after some arbitrary interval, the re- 
" sponses of the two groups are totaled, 
and it is often found that the experi- 
mental group responded more than 
the undeprived group. This classical 
“test for drive“ can discover but one 
of two nominal classifications for re- 
“sponse systems—drive and nondrive. 
As Figure 2 shows, a system showing 
any appreciable amount of partial re- 
placement might be clasified as a 
“drive” system without discriminating 
between 100% replacement systems 
and low replacement systems. The 
Status of presently controversial be- 
havioral systems might be clarified by 
procedures which measure the amount 
of replacement behavior on a continu- 
ous scale with more than two cate- 
gories. 

Certain replacement behaviors are 
more than complete. Hoarding (e. g., 
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_ Fic. 3. Example of a repetitive response 
stem which, following deprivation, over- 
compensates for the theoretical accumulated 


: en: 
deficit. 
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Fic. 4. Example of a bimodal reversible 
response system. (When the opportunity 
to respond was restored at Time R, the 
system did not return to its “addicted” rate 
rı but assumed the preaddiction rate rs 
The system’s two operating modes are r: 
and rs. The system is reversible because it 
can be unaddicted.) 


Morgan, 1947) following deprivation 
serves as an example, and certain 
complex human responses are some- 
times said to be overcompensatory 
following deprivation. Overcomplete 
replacement behavior is illustrated in 
Figure 3, which shows 160% replace- 
ment following deprivation. In this 
hypothetical example, rı was the daily 
rate of food-transport to the home 
cage prior to deprivation. After the 
opportunity to respond was restored 
(Time R), more food was transported 
to the home cage than would have 
been (re) had there been no depriva- 
tion. 

The several hypothetical examples 
already discussed were assumed to re- 
turn to the predeprivation rate after 
the system was experimentally dis- 
turbed. Deprivation, however, might 
alter the rate parameter of certain 
repetitive response systems such as ad- 
dicted behavior. Figure 4 shows a 
hypothetical addiction system (e.g., 
smoking) operating at rate rı. Fol- 
lowing deprivation, rə is zero. It can 
be safely assumed that some previous 
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rate was zero, but, following loading 
procedures, became stabilized at r,. 
Since deprivation restored the func- 
tional properties of the behavioral sys- 
tem to a former value (not smoking), 
the system could be termed reversible. 
A general taxonomic device such as 
is here proposed should allow for the 
general case, and hence for unexpected 
results. Thus there might be a class 
of response systems whose rate param- 
eter rz can be experimentally “set” 
at various values intermediate between 
abstinence and some upper limit by the 
operations of deprivation and loading. 
A change in the rate parameter of a 
response system can be expressed as 


100| arctan = — ss] 
71 
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in which Ar is the change in rate, r, 
is the predeprivation or preloading 
rate, and rz the postdeprivation or 
postloading rate. The effect of the 
several constants is to create a scale 
ranging from 100 to —100, with zero 
representing no change in rate, or 2 = 
71. 

The other system parameter, per- 
centage of replacement C (for com- 
pensation), can be clarified by refer- 
ence to Figure 2. The theoretical 
deficit at Time R is the vertical differ- 
ence between r, and the response 
function. At some Time T, fa has 
stabilized and the deficit has been re- 
placed by 20%. C is thus defined as 


C= dr — ae 

dr 
with dr equal to the theoretical deficit 
at time T and d the deficit at time R. 
C can range from zero tow, with C = 
100 representing complete replacement 
of the theoretical deficit, as depicted 
in Figure 1. 
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The two parameters C and Ar 
suffice to describe the response to ex- 
perimentally introduced perturbations 
of behavioral systems as diverse as 
eating, drinking, manipulation, locomo- 
tion, hoarding, self-injected drugs, ete. 
These parameters are quantitatve, their 
relevant operations are specified, and 
they are independent of the type of 
response, response units, and time 
units of a given experiment. The four 
hypothetical examples can be ade- 
quately described by their C, Ar values. 
The ideal reservoir system of Figure 1 
is a 100, O system; the partial re- 
placement system of Figure 2 is a 20, 
0 system; and the next two figures 
show 160, 0 and 0, —100, respectively. 

A graphic summary of response sys- 
tems within a parameter space is pos- 
sible. With Ar on the abscissa and 
C on the ordinate the two parameters 
of a given system locate it as a point 
in the two-dimensional space. For 
example, reservoir systems would 
probably cluster together in the central 
region of 100, 0. Other clusters might 
occur as natural groupings in other 
locations, and it would be possible that 
systems within a cluster showed simi- 
lar motivational effects. 

The illustrations discussed have as- 
sumed both stability and linearity of 
the response rate. The assumption of 
stability rules out behaviors which 
fluctuate capriciously in rate from one 
time unit to the next in a constant en- 
vironment. It is likely, however, that 
many response systems operate in a 
stable and linear way when extraneous 
variables are fixed at static levels and 
when behavior is sampled over an ap- 
propriate time base. For example, the 
gnawing behavior of rats (which wears 
off the constantly growing teeth) could 
appear haphazard over short time pe- 
riods but reveal order over long time 
samples. 

The assumption of linearity is 4 
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first approximation. If a system is 
stable but nonlinear, extrapolations for 
the special case could be made. In 
these cases, a control group is es- 
pecially suited for determining r,. It 
might be noted, however, that any 
constantly accelerating rate function 
means that the behavior in question 
would soon either vanish or, for in- 
creasing rates, come to occupy all the 
organism’s time. Thus it is reason- 
able to expect that most systems would 
demonstrate but small departures from 
linearity, possibly with an order of 
magnitude small with respect to ex- 
perimental error. 

In the examples discussed no spe- 
cific deprivation interval was assumed. 
There are at least two major con- 
siderations in this regard. The first 
is that the interval should be ap- 
propriate to the normal timebase of 
the system. For example, animals 
cannot be air-deprived for as long a 
time as they may be water-deprived. 
This consideration is probably en- 
compassed within the second, which 
Stems from the evidence that replace- 
ment behavior is not always a linear 
function of time. For example, Stellar 
and Hill (1952) measured water re- 
placement behavior following various 
deprivation intervals. Their data in- 
dicate approximate linearity of re- 
Placement up to about 48 hours of 
deprivation. It seems quite probable 
that many response systems, as dep- 
tivation continues, react by altering 
internal system parameters. To the 

extent that these parameters are 
altered. the system is no longer the 
0 ” 7 

_ Same” system as entered depriva- 
tion. Inanition, stomach shrinkage, 
greater or lesser activity, or sensory 
adaptation could change the system 
Parameters underlying 7. Therefore, 
the behavior of control groups or extra- 
polations of y would not be legitimate 
data for inferring accumulated deficits. 


145 


Until more is known about the non- 
linear properties of response systems 
during exaggerated deprivation, the 
exprimental operations suggested in 
this paper would probably be best con- 
fined to that portion of the depriva- 
tion interval during which replace- 
ment behavior is a linear function of 
time. 


A Posstste Taxonomy or REPETITIVE 
Response SYSTEMS 


The two parameters C (percentage 
of replacement behavior) and Ar 
(change of response rate) were sug- 
gested as adequate for locating a given 
response system in two-dimensional 
space. Such a space is shown in Fig- 
ure 5, and located in it are the figure 
numbers of the four hypothetical sys- 
tems used as illustrations. The loca- 
tion of numerous other systems could 
have been guessed at by the writer, 
but the reader may do this for himself 
without publishing his errors. As an 
example of locating an unknown sys- 
tem, one might imagine that, follow- 
ing deprivation of bathing, a human 
would show some slight replacement 
behavior followed by the same bathing 
rate exhibited prior to deprivation. 
This response system would then be 
located in Figure 5 in the region of 
perhaps 10, 0. The system also bears 
extension beyond the animal kingdom; 
for example, the number of cubic feet 
of water flowing past a point in a 
stream bed could be defined as “re- 
sponses.” A temporary dam is built 
to create a deprivation interval, then 
opened to allow “the opportunity to 
respond.” Simple streams, without 
alternate overflow paths, would doubt- 
lessly prove to be 100, 0 systems. 

In addition to locating response sys- 
tems in C, Ar space, various nominal 
classifications can be anticipated. 
Rate-stable, or unimodal, systems 
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systems in the two-dimensional space de- 
scribed in the text. (The located numbers 
refer to figures illustrating the system.) 


would be those capable of assuming 
but one rate under standardized con- 
ditions. Breathing serves as an ex- 
ample. Bimodal systems would be 
those capable of operating in two 
modes, such as a system subject to 
addiction. Polymodal systems would 
be those capable of being “set” at 
various rates by experimental pro- 
cedures. All polymodal systems would 
occupy more than one location in C, 
Ar space, and different behaviors 
would correspond to the system loca- 
tions. For example, an opiate might 
be expected to be a reinforcer for an 
addicted animal but not for an unad- 
dicted one. Reversible polymodal sys- 
tems would be those which could be 
returned to Mode 1 after being set in 
Mode 2, as in breaking a habit by 
deprivation. Symmetrical systems 
would be those yielding equal C, Ar 
values following both deprivation and 
loading. As an example of possible 
taxonomy, the water-ingestion system 
might be a unimodal irreversible sym- 
metrical 100, O response system. In 
this particular case, “irreversible” is 
redundant, for bimodality is a neces- 
sary condition for reversibility. 


LOCKARD 


Some response systems are obvious! 
surrogate. For example, a rat d 
prived of responding to drinking tul 
A will show little or no replac 
behavior if drinking tube B re 
during the deprivation interval. 
sponse systems A and B are 
totally surrogate when tubes 4 and 
both deliver water. System 
could be made partially surrogal 
for A by adulteration, for examp 
with quinine sulphate. Whenever a ri 
sponse system shows partial repla 
ment, surrogate systems should 
sought. For example, home cage at 
tivity may be partially surrogate fe 
activity-wheel locomotion, and an a 
mal deprived of the latter might shos 
an increase in the former which, 


is known of the surrogate propertie 
of various response systems althoug 
much behavioral methodology is base 
upon the ability of organisms to sub 
stitute one response system for ano 


opportunity to respond for food ¥ 
System A must substitute System 
(e.g., bar pressing) to maintain re, her 
defined as caloric intake. But sine 
psychological interest has focused 0 
systems in transition, as opposed to th 
properties of steady-state systems, T 
sponse substitution phenomena haz 
not usually been viewed as such. Hae 
they been, certain past findings mig n 
have been viewed more thoroughly } 
this framework. For example, vista 
exploration responses of monkey 
(Butler, 1953) could have stimulated 
investigation of possible home cag 
systems for which door opening 8 b 
stituted, to what degree they wer" 
surrogate, and so forth. A taxonom 
of response systems, based upon anal 
sis of their modal, symmetrical, a! 
surrogate properties, might do mud 
to clarify the problem of one D verst 
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many drives (¢.g., Brown, 1961) and 
would seem relevant to issues regard- 
ing displacement behavior. 

The method of analysis proposed 
herein is not a theory of behavior. 
It suggests a set of operations for the 
quantification and subsequent classifi- 
cation of response systems. It is sug- 
gested, however, that systems found 
functionally similar will also be found 
similar in new behavioral situations, 
especially those to which concepts of 
motivation apply. At present, there 
are two seemingly antithetical be- 
havioral models clearly applicable to 
postdeprivation behavior: the homeo- 
Static model discussed by Young 
(1951) and the adaptation-level model 
of Helson (1959). Following depriva- 
tion of an unknown Response System 
X, the homeostatic model predicts 
high replacement behavior and no 
change in response rate. The adapta- 
tion-level model predicts low replace- 
ment behavior and re < rı. Another 
likely outcome, low replacement be- 
havior and no change in rate (e.g. 
Figure 2), is not a prediction of any 
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current, well-known behavioral model. 
These seemingly contradictory predic- 
tions of different behavioral models 
are all cases of the proposed C, Ar 
system which differ not categorically, 
but merely in continuous parameters. 
The proposed system thus has the ad- 
vantage of subsuming within one 
framework several seemingly unrelated 
formulations with different origins. 
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A COUPLED SYSTEM FOR CODING AND LEARNING 
IN SHAPE DISCRIMINATION 
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A system for coding visual information about shape is described. 
It is shown how the output from this coding system can be coupled 
to a classifier, or recognizer, in such a way that shape equivalence is 
preserved. A major weakness of previous coding-type theories oí 
shape recognition is remedied, in that a memory system is built into 
the recognizer, which becomes self-organizing in terms of stimulus 
regularities in the environment. This is shown to be a logical basis 
for perceptual learning and perceptual constancy. Recent confirma- 
tory neurophysiological findings from direct recordings in the striate 
cortex of cat are briefly considered, and the possibility of coupling the 


recognition system to a response 


INTRODUCTION 


It has been argued elsewhere (Dod- 
well, 1961c) that the orthodox proba- 
bilistic approach to constructing 
theories of discrimination learning is 
weak, because of the ambiguity with 
which the term “stimulus” is used in 
such theories. An ambiguous term in 
an abstract theory may have functions 
similar to those of a deus ex machina, 
and should be regarded with suspicion. 
It was also argued that a more precise 
definition of the term was needed, and 
that one way of defining stimulus 
would be in terms of a system for cod- 
ing or classifying sensory input. In- 
deed, for some types of visual dis- 
crimination, such as shape recognition, 
there is now considerable behavioral 
evidence to indicate the general prin- 
ciples on which such classifying sys- 
tems may work in fairly simple visual 
systems (e.g., Deutsch, 1960; Dodwell, 
1957, 1960, 1961a, 1961b; Sutherland, 
1957, 1959, 1960, 1963). This evi- 
dence has recently been supplemented 
by neurophysiological findings ob- 
tained by direct recording from indi- 
vidual neurons in the visual cortex of 
cat (Hubel & Wiesel, 1959, 1962). 

The concept of shape coding, as 
originally suggested by Deutsch 


system is outlined. 


(1955), can be held to be unsatisfae- 
tory because it does not allow for the 
operation of perceptual learning. This 
criticism can be leveled at a shape 
coding theory as a theory of shape 
recognition, that is, as a complete 
theory about how shapes are recog- 
nized, but there is no reason why the 
output from a coding system should 
not be coupled to a learning system or 
“response probability generator” to 
yield a more complete and logically 
satisfactory theory. In fact it is the 
purpose of this paper to show that it 
can be done, and to show that the 
coupled system is not open to the sorts 
of criticism which can be made of 
either component (i.e., the shape cod- 
ing system and the probabilistic dis- 
crimination learning model) on its 
own. 

The system to be described consists 
of an elaborated version of a shape 
analyzing system proposed earlier to 
explain shape discrimination in rats 
(Dodwell, 1957), connected to a sig- 
nal classification system suggested by 
Uttley (1954). This dual system 
does not directly predict response 
probabilities in the way stochastic 
models do, but a straightforward ex- 
tension of the double system can do 
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so. This combined model gives a 
good fit to most of the behavioral 
data on shape discrimination in the 
rat and octopus, and also incorporates 
many of the features discovered in the 
direct neurophysiological exploration 
of the striate cortex of cat. 


Tue SHAPE CODING SYSTEM 


The system envisaged consists of a 
two-dimensional array of units ar- 
ranged in parallel chains, so that each 
unit has two neighbors, except for the 
end units of each chain, each one of 
which has only one neighbor. The 
chains are functionally independent 
except in two important respects: one 
set of end units (Set A) is so ar- 
ranged that if one of them fires, they 
all fire simultaneously, and each one of 
the other set of end units (Set B) is 
connected to a single transmission line, 
called the final common cable (fcc). 
Each chain has the same number of 
units (this is not a necessary condition 
of the working of the system, but its 
introduction simplifies the exposition ). 
Each unit is connected to a receptor, 
or group of receptors, on the retina, 
and when the receptor (or group of 
receptors) is stimulated this activates 
the unit to which it is connected. Such 
a unit is “active”; when a unit is not 
so activated it is “passive.” An ac- 
tive unit does not affect any of its 
neighbors by transmitting “activation” 
to them. It is assumed that only ex- 
citation from regions of sharp change 
in stimulation (contours) activates the 
array, and that adjacent receptors (or 
groups of receptors) activate adja- 
cent units. Thus, when the image of 
a sphere is focused on the retina, a 
set of units corresponding to a circular 
contour is activated on the array. 
“Corresponding” is used advisedly 
since, although a one-to-one corre- 
spondence between receptors (or 
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groups of them) and units is postu- 
lated, this does not imply any simple 
geometrical similarity in the mapping 
of the points in the two regions. 
However, an important assumption is 
made about horizontal straight lines: 
it is assumed that (with the retina 
in normal orientation) stimulation by 
a horizontal line will activate units all 
on the same chain. 

A second process in the units, called 
excitation, originates in the firing of 
the End-set A, and results in a 
“sweep” across the array, since each 
unit can pass excitation from itself to 
its neighbor in the direction of the 
End-set B. All units of this set dis- 
charge into the fcc. This excitation 
is not to be confused with activation 
of units by receptors, or groups of 
receptors, to which they are con- 
nected. The transmission of excita- 
tion from unit to unit involves a 
small delay, the length of which is 
determined by whether the transmit- 
ting unit is active or passive. Only 
the two states are envisaged, and the 
delay for each is constant, say Ta and 
Tp. For convenience let Ta > Tp. 

Each unit can pass excitation to one 
neighbor only, and excitation always 
travels from A towards B. The time 
for excitation to traverse a chain of 
n+ 1 units, all of which are passive, 
is n (Tp). If the units of the array 
are passive, the traverse times will be 
equal, and there will be a single large 
discharge into the fcc, since all the A 
units fire together and excitation 
reaches all the B units simultaneously. 

When some units are active, some 
passive, the traverse time for any par- 
ticular chain will depend only on the 
number of active and passive units on 
it, and can be written in the form n 
(Ta) + (n—r)(Tp). The added de- 
lay involved when r units are active 
on a chain is obtained by subtracting 
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n (Tp) from this expression: 


(Ta) + (n—r)(Tp) 
— n (Tp) (Ja — Tp) 


Let the additional time for transmission 
through an active unit be AT. Thus 
AT = Ta —Tp, and the added delay 
on a chain with r active units is rAT. 
This may be called the “delay func- 
tion” for a single chain. The delay 
function for an array of N chains will 
be made up of a set of terms of this 
form. Effectively, the system trans- 
forms spatial information into a func- 
tion of a single variable, t. Thus, the 
characteristics of the delay function 
are independent of which particular 
chains, or which units on those chains, 
are activated. That is, the signal in 
the fcc is independent of the signaled 
shape’s position (but not its orienta- 
tion) on the array. 

Sets may be formed for chains hav- 
ing equal values of r. One of these 
will be the set for which r = 0, which 
may be written [qo nTp], where qo 
represents the number of chains in the 
set, and hence the amplitude of the ini- 
tial volley in the fcc. The total signal, 
for one “sweep” will be the union, or 
logical sum, of these sets. The set 
for any one value r = contains two 
components, gx, and r,AT, The former 
is a measure of the amplitude of 
pulse generated in the fcc, the latter 
a measure of the time at which the 
pulse is generated, measured from the 
initial pulse (when r = 0) as base line. 
The delay function for the array is 
sufficient for deducing the important 
characteristics of a signal if, as would 
often be the case, the number of 
empty chains and the size of the initial 
volley are unimportant. In general it 
is necessary that there should be either 
at least one large set with equal r, or 
at least one chain with a value of r 
that is atypical, for a distinctive signal 
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to be generated (either qe or ar 
both, identify a set). 

It has already been stated that 
array is assumed to be so arranged 
that chains represent, functionally, one 
of the principal axes of the visual 
field; stimulation of receptors by a 
horizontal line will activate units all 
on one chain. Which chain this is 
will be immaterial as far as the signal 
in the fcc is concerned; the delay func- 
tion is simply [I, rAT], and the dis- 
tinctiveness of the signal is due to this 
one atypical value of r (for all other 
chains, r=0). Two horizontal lines 
would be distinguished from one by @ 
doubling of the amplitude of the de- 
layed discharge, and so on. 

In general, the system will yield 
distinctive signals for different shapes, 
and is sensitive to changes in orienta- 
tion (rotation in the frontal plane). 
Simple geometric considerations show 
that this sensitivity is. greatest for 
small rotations of a horizontal straight 
line (provided the density of units om 
the array is uniform) whether or not 
this is part of a shape, such as the 
base of a square or triangle. The 
same considerations show that it 
least sensitive to rotations near the 
vertical. However, as it stands the 
system will not yield distinctive signals 
for some pairs of shapes which one 
would expect to be readily discrimi- 
nable, such as a diamond and a pair 
of vertical lines. This difficulty can 
be overcome by postulating two sys- 
tems, functionally perpendicular to 
each other, so that in the second @ 
vertical line in the visual field activates 
units all on one chain. Such a du 
system will explain fairly satisfactorily 
most of the data on shape discrimina- 
tion in the rat and in the octopus: 
For instance, it will explain the fact 
that both species can readily discrimi- 
nate horizontal from vertical straig 
lines, but have difficulty with mutuall 
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perpendicular lines set at 45° to the 
horizontal; the fact that mirror images 
tend to be confused, that transfer 
tends to be easy (for the rat) when 
the transfer shape shares a horizontal 
contour with the training shape and 
that rotation of shapes in the sagittal 
plane tends to upset discrimination. 
Detailed evaluations of the empirical 
evidence for this, and for other shape 
coding systems, have been published 
elsewhere (Dodwell, 1957, 1961b) and 
will not be repeated here. While it is 
true that some facts about shape recog- 
nition are not compatible with the 
theory, it should be remembered that 
other means of discrimination, such 
as by brightness distribution, may en- 
ter as contaminating factors (see 
Dodwell, 1961b). The important fea- 
ture to be brought out at the moment 
is that the primary information proc- 
essing postulated in the coding sys- 
tem is the first step in the specification 
of a logically satisfactory theory of 
shape recognition. The features of 
the system that are important in this 
respect are, first of all) the fact that 
in it spatial information is transformed 
into a temporal analogue, and secondly, 
that this analogue consists of se- 
quences of pulses of different ampli- 
tudes and latencies which all occur in 
a single transmission line. It is not 
claimed that this is the only suitable 
way of coding information about shape 
for further processing, but it has two 
major advantages. 

The first advantage is that the in- 
formation about shape is coded inde- 
pendently of the position on the array 
(and hence on the retina) at which 
the shape was present. This means 
that the system overcomes the prob- 
lem of identification of a shape in dif- 
ferent locations—which has always 
been a major difficulty for neural 
theories of shape discrimination—in a 
simple fashion. The second advantage 
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is that it makes possible an attempt 
to solve the problem of stimulus 
equivalence in general (recognition of 
shape independently of position is, of 
course, also part of the problem of 
stimulus equivalence, but is a rather 
special case). Explicit separation of 
the operations of “coding” and “rec- 
ognizing” allows one to reformulate 
the problem of stimulus equivalence 
into the question: which outputs from 
the coding system are classified in the 
same way by the “recognizer”? To 
answer this question one must con- 
sider how the recognizer operates on 
the outputs from the coding system. 


THE RECOGNIZING SYSTEM 


How can “recognition” be defined in 
a formal system of the type here en- 
visaged, which is to be equated, in 
some sense, with neural processes in 
the brain? One may define recogni- 
tion as the registration of some con- 
stant pattern of activity, or activity at 
some constant locus, for a given input 
pattern or class of input patterns. 
Such a system will now be described. 

The recognizer accepts and classifies 
the output from the coder, which in 
the present system consists of se- 
quences of pulses of varying frequency 
and amplitude, all occurring in a single 
transmission line (or in two lines for 
the double system; the single system 
will be described, since the problem is 
in principle identical, but more com- 
plicated, for the double system). A 
system has been outlined by Uttley 
(1954) which, with slight modifica- 
tions, will classify inputs of this type. 
As it stands, Uttley’s system classifies 
temporal patterns of discrete pulses 
of equal amplitude, and is thus a 
digital system. The outputs of the 
coding system described above are 
analogues of the geometrical proper- 
ties of shapes; however, Uttley 
(1954) indicates a simple analogue- 


152 


Perer C. Dop wt. 


Fic. 1. 


digital conversion system, of a type 
which might be used in a computer, 
which could occur in a nervous system 
(p. 486). There is evidence that ana- 
logue-digital conversion does occur 
in certain neural structures (eg., 
Hilali & Whitfield, 1953), so it is not 
implausible to suggest that some such 
conversion occurs between the outputs 
of the coder and the recognizer. The 
particular form of analogue-digital 
conversion suggested requires that the 
recognizer have a number of inputs, 
and is as follows: each volley in the 
coder output gives rise to a “unit” im- 
pulse in one of the inputs to the recog- 
nizer. If l different amplitudes are to 
be discriminated, there must be ! in- 
puts in the recognizer: each input fires 
for one and only one value of ampli- 
tude qx, or for only a small range 
of such values. For the simplest case, 
where only two different amplitudes 
are recognized, two inputs to the rec- 
ognizer are required, inputs j and k, 
say. These two inputs are connected 
to counters which register either (a) 
the initial arrival of unit impulses in 
either input, or (b) the temporal pat- 
terns of impulses on either or on both 
inputs. This is achieved by feeding 
each input into a series of delay units, 
and then to the counters, as shown in 


The recognizer for outputs from a shape coding system. 
(For explanation, see text.) 


Figure 1. (The recognizer can be 
imagined as either a nerve network or 
as an electrical system; however, its 
properties are specified independently 
of any particular embodiment.) Ex- 
cept for the first two counters (j and 
k), a counter will only register when 
two impulses arrive at it simultane- 
ously or within an arbitrary small 
interval of time. To see how this 
occurs, consider the counter labeled 
jy” in Figure 1. It will register if, 
and only if, an impulse in the j input 
is followed by a second impulse in the 
same input after time r, where 7 is 
the delay induced by the first delay 
line on input j. Similarly, the ad- 
jacent count jk’ registers only when 
an impulse on j is followed by an im- 
pulse on k after time r. (The primes 
refer to the number of delay units 
separating a pair of impulses.) As- 
suming + = hAT, where h is an integer, 
the time intervals required to activate 
different counters will be simply re- 
lated to the output from the coder. 
It can be seen that the recognizer of 
Figure 1 will discriminate all possible 
patterns for the two inputs j and k, 
up to time delays of 37. For patterns 
involving more than two impulses, 
more than one counter (excluding the 
initial j and k counters) will register. 
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A simple higher order type of recog- 
nition can easily be added, whereby 
a counter jj'kk”, for example, will only 
register when both jj’ and jk” are 
registering. jj’ and jk” fire simul- 
taneously if, and only if, an impulse 
in $ is followed by one in j after time 
2r, followed by a second impulse in j 
after a further time r. In this case 
the counter jk” would also fire, of 
course, so that at a third level of clas- 
sification a counter jj'jk"jk’’ would 
register. This hierarchical system of 
classification can be continued to any 
desired level of complexity, according 
to the number of inputs, and the num- 
ber and complexity of patterns to be 
classified. The only restriction re- 
quired is that ur < nAT where U is the 
number of delay units in the recog- 
nizer, and n, as before, is the number 
of units per chain in the array. 
“Equivalent stimuli” can now be 
defined as inputs from the receptors 
which give rise to the activation of 
the same set of counters in the recog- 
nizer. It may be thought that little 
is gained by introducing a recognizer 
of this. form—that in fact the original 
information about shape has been lost 
almost beyond recovery, and that no 
compensating advantage has been se- 
cured. However, it should be noted, 
in the first place, that original signal 
information is contained unambigu- 
ously in the system of registers that 
are activated: if the system were re- 
versed, so that the counters became 
signal generators, the original input 
signal would be perfectly reproduced 
for a given constellation of registers, 
but now as an output from the system 
(Uttley, 1954). Secondly, a distinct 
advantage has been secured in the 
sense that a structure is set up by 
which a constant local pattern of activ- 
ity is a necessary consequence of a 
Spatial pattern, at the receptors, which 
is not location specific. As Ayer 
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(1957) points out, the occurrence of a 
necessary consequence is in all respects 
equivalent to a sufficient condition for 
the antecedent’s occurrence; i. e., the 
occurrence of a given pattern at the 
recognizer is sufficient to ensure that 
a particular pattern did activate the 
receptors. The only exception here is 
the fact that certain different patterns 
may give rise to identical or nearly 
identical patterns from the coder, in 
which case perceptual confusion arises. 
The question of whether or not “per- 
ceptual confusion” is to be equated 
with “perceptual equivalence” is taken 
up in the next section of the paper. 
Before taking up this question, it may 
be noted that an information-reducing 
system such as the coder, interposed 
between the receptors and the recog- 
nizer, serves to remove an objection 
that has been raised against the Uttley- 
type recognizer. This is the objection 
that, to be effective, such a classifica- 
tory system would need an impossibly 
large number of units to deal with all 
combinations of receptor activation 
patterns (Uhr, 1963). 


PERCEPTUAL LEARNING 


The coupled system so far described 
has no “memory”; its performance is 
entirely independent of its own previ- 
ous states. As such, therefore, it is 
deficient as a model for perception. 
To show how the system can be modi- 
fied to incorporate a type of perceptual 
learning, the notion of a conditional 
probability computer can be intro- 
duced. The idea is, again, due to 
Uttley (1956, 1958, 1959). Consider 
the hierarchical classification system of 
Figure 2. The lines a, b, and c repre- 
sent inputs, the open circles represent, 
as in Figure 1, coincidence counters. 
Simultaneous inputs at, for instance, a 
and b will activate the three counters 
a, b, and ab. Simultaneous inputs at 
a, b, and c will activate all the count- 
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classification, (For explanation, see text.) 
ers. Suppose now that each counter 
has a limited memory, such that each 
time it counts, its state is altered step- 
wise, but gradually tends to revert to 
its initial condition. (A simple physi- 
cal analogue is a leaky condenser, to 
which a standard charge is applied, 
each time the unit counts.) Suppose, 
further, that in this system two count- 
ers in approximately the same state 
(with equal charges) will count if 
either one of them is activated. For 
example, if b has almost always oc- 
curred at the same time as a, then if 
a is activated (but not b) the b and ab 
counters will also be activated. By 
choosing a suitable criterion for “same 
state” such a system computes condi- 
tional probabilities at a specified level. 
(The probability that b occurs, given 
a, determines whether or not the b 
and ab counters register when a occurs 
alone.) The details of these com- 
puters, and the related conditional 
certainty computers, are given by 


Perer C. DopwELL 


Uttley (1956, 1958, 1959) and will no 
be repeated here, except to note th 
by building in suitable short- and lon 
term decay characteristics, rather g 
analogues of the main features of 


junctions of patterns, and conjunc 
tions of patterns occurring in rapid 
succession. Such conjunctions of pa 
terns must lead to “generalization” 
“inference” by the machine, and 
is all that is needed to complete 
type of perceptual learning requi 
for the coupled system of coder at 
recognizer. For, in its normal 
vironment, an organism is in consta 
visual contact with stimuli which 
fluctuate in size, shape, locus of stimu- 
lation of the retina, and so on. Ho 
does it come to “organize” or “recog- 
nize” a stable visual world? i 
usual answers are either that it learns 
to perceive the regularities among 
stimuli and conjunctions and 
quences of stimuli, or that such pet 
ceptions are built in—both answe 
which nicely beg the question at iss 
when couched in such general terms. 
The coupled system offers a solu- 
tion which is not so flagrantly questi 
begging. If one imagines a particu! 
object in the early environment of an 
organism, say a circular disk, then 
the organism approaches the obje 
moves around, over or away from 
the size, shape, orientation, etc. of 
the object will change, but these 
changes will be orderly, and will gi 
rise to orderly changes in the output 
from the coder, and hence to order! 
sequences of counts in the units of 
recognizer. Many of these sequen 
will occur in fairly rapid successi 
(as when the organism locomote 
and, eventually, will give rise to “i 
ferential learning” when the con 
tional probabilities become hi 
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enough, and stable enough. (In the 
interests of reasonable conciseness 
these conditions have not been enumer- 
ated; Uttley (1956, 1958) discusses 
them.) That is, when one member of 
the sequence is counted by the recog- 
nizer, if that counter’s state is suffi- 
ciently close to the states of some 
other counters in the system, these will 
also register. The recognizer will 
therefore build up a system of internal 
classification which is dependent on 
the organism's history, to the extent 
that regularities in the perceived en- 
vironment will be matched by regular 
patterns of activity in the recognizer. 
Thus, one can envisage two types of 
stimulus equivalence, one due to the 
ways in which the coder processes in- 
coming patterns, which gives rise to 
“low order equivalence” because out- 
puts from the coder are identical or 
very similar, the other due to experi- 
ence of regular sequences of signals 
from the coder, which builds up higher 
orders of equivalence within the recog- 
nizer. Thus, for example, in this 
system the equivalence of a pair of 
circular contours is explainable in 
terms of the nearly identical outputs 
they generate in the coder. On the 
other hand, the “equivalence” of dif- 
ferent perspective views of a circular 
disk, which may generate quite differ- 
ent outputs from the coder, is explain- 
able in terms of “perceptual learning,” 
which means, in this context, the regu- 
lar conjunction of sequences of dif- 
ferent signals. Such higher-order 
equivalence can even be thought of— 
rather speculatively—as the basis for 
perceptual constancy. It may be noted 
that this system offers an explicit and 
quite precise solution to the problems 
which Hebb’s cell assembly theory was 
designed to meet. Hebb’s explanation 
of “superordinate activity” as a basis 
for equivalence is vague, and in fact 
circular (Hebb, 1949, p. 98). The 
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present system incorporates both the 
feature of low-level “preset” recogni- 
tion of simple visual elements, and the 
feature of learned equivalence on the 
basis of repeated contiguous stimula- 
tion, but explains these as conse- 
quences of the initial coding system 
and the self-organizing properties of 
the recognizer, respectively. The first 
of these two deals satisfactorily with 
an awkward feature of Hebb's theory, 
namely that of explaining. just how 
a primitive feature—a line, or angle in 
his argument—is initially “equivalent” 
for different retinal stimulus loca- 
tions, and the second allows us to 
give an unambiguous definition of 
what Hebb calls “superordinate activ- 
ity,” that is to say, the constant physio- 
logical event that mediates perceptual 
recognition and generalization. The 
gain in precision leads to a more 
powerful theory of perceptual learn- 
ing, which means of course one that 
is more amenable to experimental 
verification. 

The testable consequences of this 
theory stem mainly from the proper- 
ties of the coding system, and these 
predictions have been made, and some 
of them tested, in other places (e.g., 
Dodwell 1957, 1960, 196la). How- 
ever, the combined system does allow 
one to make a specific statement about 
perceptual equivalence, perceptual con- 
fusion, and the limits of discriminabil- 
ity. By the definition of the system, 
any pair of similar outputs from the 
coder are “equivalent,” at the lowest 
level, for the recognizer. In view of 
the inferential character of the recog- 
nizer’s operation, it is clear that, after 
some perceptual learning has occurred, 
a pair of similar outputs from the 
coder will activate a set of higher- 
order counters in the recognizer. If 
the pair of outputs actually are conse- 
quents of the reception of two differ- 
ent shapes, then perceptual confusion 
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has occurred. However, any one 
shape (or object) will, in general, be 
capable of generating a number of 
different outputs from the coder, as 
explained above, and it is the set of 
these possible outputs that defines 
final, learned, recognition. Only in a 
case where all the possible outputs 
generated by a pair of shapes are the 
same would one have to admit the 
logical necessity of equivalence by con- 
fusion. Otherwise, successive dis- 
crimination of differences can occur in 
the system with prolonged visual in- 
spection and search, as the two sets 
of units corresponding to the two 
shapes are organized in the recognizer. 
It should now be clear, too, that the 
system can handle contextual effects. 


Some NEUROPHYSIOLOGICAL 
EVIDENCE 


The model presented in the previous 
three sections was devised and in part 
published some time ago (Dodwell 
1957, 1958), but at that time there 
was behavioral evidence bearing only 
on the form of the coding system 
in any direct way. The recognition 
system was elaborated to complete the 
model for perception in a logically 
satisfactory way, as indicated in the 
Introduction above. It did no violence 
to the facts, yet there was no solid 
evidence in its favor—it was specula- 
tive. However, quite recently some 
neurophysiological findings which bear 
directly on this system have been re- 
ported, findings which confirm quite 
strikingly that the main features of 
the proposed coding and recognition 
system are correct, although modifica- 
tion in detail may eventually be re- 
quired. 

Hubel and Wiesel (1959, 1960, 
1962) in a number of publications 
have reported on their method of di- 
reet recording from individual neurons 
in the striate cortex of cat. Their 
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principal early finding (1959) 
that many of these neurons have 
defined retinal receptive fields, whi 
have a specific orientation, in contras 
to retinal receptive fields at the lateral 
geniculate body, which tend to þ 
circular. Moreover, such cortical cells 
characteristically are maximally sensi 
tive to a stimulus which consists of i 


line of the receptive field, and in 
orientation. Small changes in 
orientation of the line relative to 
retina are accompanied by rapid 
creases in sensitivity, and sn 
changes of position on the retina g 


tern of the cortical neuron, 
typically the receptive field consists 6} 
a narrow excitatory band, surrounded 
by a wider inhibitory field, or V 
versa. It is contour rather than 
face that activates these neuro 
blanketing a receptive field with a s 
face leaves the neuron inactive. Th 
so-called “simple” fields can be map 
out with small spots of light, to det 
their excitatory and inhibitory a 
and knowledge of these areas is su 
cient to interpret the response of th 
neuron to more complex stimuli, s 
as a rectilinear contour, in terms 
summation of excitation and inhibi 
from different parts of the field. 
Clearly the simple receptive fi 
represents a stage of processing S€ 
ory input which is very similar 
the stage postulated for the coder 
scribed above. The latter’s princi 
feature is that contours generate 
cific signals according to their orien 
tion and position on the two-dim' 
sional array. It will be recalled 
these signals are taken into a fun 
of a single variable, f, in the final 
mon cable, in such a way that inf 
mation about contour orientation 
preserved, but positional informa 
is not. Contours thus become equ 
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lent, so long as they are in the same 
orientation, and this is represented in 
the model by the fact that such equiva- 
lent contours all activate the same unit 
in the recognizer. Interestingly 
enough, Hubel and Wiesel (1962) 
have identified cortical neurons hav- 
ing exactly this property also; namely, 
that they are activated only by recti- 
linear contours in specific orientations, 
but the position of that contour on 
the retina can vary considerably, in 
some cases over a field subtending an 
angle of up to 32° at the eye. This 
clearly represents a further stage in 
the coding process, beyond the simple 
fields. To be specific: 


The cell . . . failed to respond to round 
spots of light, whether small or large. By 
trial and error with many shapes of stimulus 
it was discovered that the cell's firing could 
be influenced by a horizontally oriented slit 
de wide and 3° long. Provided the slit 
was horizontal its exact positioning within 
the 3°-diameter field was not critical. When 
it was shone anywhere above the centre of 


the receptive field . . . an ‘off’ response was 
obtained; ‘on’ responses were evoked 
throughout the lower half,, . one might 


have expected wider slits to give increas- 
ingly better responses owing to summation 
within the upper or lower part of the field, 
and that illumination of either half by itself 
might be the most effective stimulus of all. 
The result was just the opposite: responses 
fell off rapidly as the stimulus was widened 
beyond about $°, and large rectangles cover- 
ing the entire lower or upper halves of the 
receptive field were quite ineffective. . . - 
On the other hand, summation could easily 
be demonstrated in a horizontal direction, 
since a slit 4° wide but extending only 
across part of the field was less effective 
than a longer one covering the entire width. 
One might also have expected the orienta- 
tion of the slit to be unimportant as long 
as the stimulus was wholly confined to the 
Tegion above the horizontal line or the 
region below. On the contrary, the orienta- 
tion was critical, since a tilt of even a few 
degrees from the horizontal markedly re- 
duced the response, even though the slit did 
not cross the boundary separating the upper 
and lower halves of the field . . . for this 
cell the strict requirements for shape and 
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orientation of the stimulus were in marked 
contrast to the relatively large leeway of 
the stimulus in its ordinate position on the 
retina [Hubel & Wiesel, 1962, pp. 114-115}. 


At the same time it must be stated 
that these authors have found quite a 
variety of receptive fields, although 
the two types described appear to be 
the main ones. They find no evidence 
for a preponderance of neurons with 
receptive fields in horizontal and ver- 
tical orientations. No evidence on this 
point is available for the rat, but there 
is anatomical evidence of dominance 
in the octopus (Young, 1960), and 
there is probably a genuine species dif- 
ference between the cat and rat in this 
respect, since Sutherland (1963) has 
recently shown that cats have no diffi- 
culty in discriminating between mu- 
tually perpendicular contours set at 
45° to the horizontal, in contrast to 
the marked difficulty of this discrimi- 
nation for the rat and for the octopus. 


Tue RESPONSE SYSTEM 


Nothing has been said so far about 
responses, or the way classification 
in the system is related to response 
probability. There are two different 
problems to be faced. The first is: 
How are responses related to stimu- 
lation during the acquisition of hier- 
archical classifications in the recog- 
nizer? The second is: How are 
responses related to stimulation for an 
already formed (stable) classification 
system or subsystem? It would go 


-beyond the scope of this paper to try 


to deal with these questions in detail, 
but a brief outline of a possible system 
might be as follows: 

At some level of the classification 
system “response generators” are con- 
nected to the counters in the recog- 
nizer. These response generators have 
the property that, if activated, they 
produce a particular response or class 
of responses. Initially, all counters at 
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the level in question have an equal 
probability of activating any one of 
the response generators (only one of 
which can be activated at a given 
time). The probability of a particular 
response generator’s being activated by 
a particular counter on subsequent 
trials is raised or lowered according to 
the history of successes or failures of 
that response to the stimulus in ques- 
tion; there is ample scope here for 
deployment of the operators of sto- 
chastic models for learning. The 
relation of response generators to 
counters would be a one-many rela- 
tion, both within and between different 
levels of the classification system. In 
this way discrimination learning would 
be possible at differing levels of com- 
plexity and abstraction. Early learn- 
ing would occur principally at the 
lowest levels of the classification sys- 
tem (responses would only be made 
consistently to simple aspects of the 
environment), where adult learning 
could be in terms of more general, 
complex or abstract features. While 
this component of the system is indi- 
cated only in the briefest way, it at 
least shows that the initial suggestion 
made in this paper, of hooking to- 
gether a coding and perceptual learn- 
ing system with a response generating 
system, is quite feasible, and should 
lead to a more realistic characteriza- 
tion of discrimination learning proc- 
esses than has previously been pos- 
sible. Finally, it may be noted that 
perceptual learning in the proposed 
system is by contiguity alone, but that 
reinforcement of instrumental re- 
sponses would certainly affect the 
contingencies through which perceptual 
learning and organization occur. 


SUMMARY 


The study of shape discrimination 
and discrimination learning has pro- 
gressed from two different, but not 
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contradictory, points of view. The- 
ories of shape discrimination have 
concentrated on the problem of coding 
or analyzing the properties of shape 
as a basis for explaining the behavior 
of organisms with simple visual sys- 
tems in discriminatory training and 
transfer situations. This approach 
has neglected the importance of per- 
ceptual learning. Theories of dis- 
crimination learning, couched in proba- 
bilistic terms, have neglected problems 
of stimulus classification and stimulus 
equivalence. A theory of shape classi- 
fication (coding) is described, to- 
gether with a system for recognition 
of the outputs of the coder, which is 
self-organizing in terms of regularities 
in the perceived environment. That 
is to say, the system incorporates the 
principle of perceptual learning. A 
possible method of connecting this 
coupled system to a response generat- 
ing system is also briefly outlined. 
Taken together, these three systems— 
coder, recognizer, and response gen- 
erator—constitute a logically satisfac- 
tory model for shape discrimination 
and discrimination learning. 
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THEORETICAL NOTE 
FURTHER ANALYSIS OF GSR CONDITIONING: 


A REPLY TO STEWART, STERN, WINOKUR, AND FREDMAN 
H. D. KIMMEL? 


University of Florida 


A reply to Stewart et al, (1961), regarding the “true” conditioned 
GSR. In addition to arguing that their “true” CR may be merely a 
part of a larger response which has gradually become inhibited, a 
definition of conditioning is suggested which permits responses to be 
classed as CRs even when they are elicited by the CS prior to train- 
ing. Data are presented to show that the latency of the initial GSR 
to the CS does change during training, contrary to their claim that 
it does not. After many conditioning trials, the initial GSR gradually 
is transformed into a highly attenuated and delayed CR. The major 
difference between this point of view and that of Stewart et al. is 
that we define the “phenomenon” of conditioning by means of experi- 
mental control procedures rather than insisting that a new response 


to the CS occur. 


In a recent article in this journal, 
Stewart, Stern, Winokur, and Fredman 
(1961) have questioned the validity of a 
considerable body of research on GSR 
conditioning on the grounds that adequate 
criteria for the definition of “true” CRs 
were neither specified nor met by pre- 
vious workers in this field. Their argu- 
ment was based principally upon accept- 
ance of a stimulus-substitution definition 
of classical conditioning and upon the data 
of their experiment (V=19). Accord- 
ing to these writers, a true conditioned 
GSR is a response to the conditioned 
stimulus (CS) which is not elicited by 
the CS before it has been paired with the 
unconditioned stimulus (UCS). Since 
almost any suprathreshold stimulus ini- 
tially elicits a GSR prior to conditioning, 
they propose to distinguish between these 
original responses to the CS and true CRs 
by means of a latency criterion. Para- 
phrased briefly, this criterion is that the 
true CR has a latency outside the range 
of latency of the UCRs to the CS. 

Applying the criterion to the data col- 
lected in their experiment, in which a 
7.5-second CS-UCS interval was used, 


1 Some of the research described was done 
under NIH Grant M-6060. 


they report ee: of this sort 
are reliably timinable from the origi- 
nal response to the that their latency 
range is greater than of the original 
responses to the CS, and that the response 
frequency curves of the two types of re- 


sponse during training are different. 
They concluded, 


In summary, we have put forward reasons 
for thinking that so far work on GSR con- 
ditioning has dealt with the adaptation and 
recovery of unconditioned responses rather 
than conditioning of responses [p. 66]. 


The present paper questions this con- 
clusion on two grounds. It will be shown 
first that their true CR may be, in fact, 
a segment of what they refer to as the 
UCR to the CS and, second, that their 
approach to defining CRs may be the 
result of arbitrary reasoning about the 
conditioned GSR and classical condition- 
ing in general. 

Actually, Stewart et al. touched on the 
possibility that the deflection which they 
have called the true CR is part of the 
initial response to the CS. In the dis- 
cussion of their Figure 2, containing 4 
sample record, they note, 


The sample showing an example of three 
responses occurring at one trial illustrates 
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Fic. 1, Average magnitude patterns of the GSR during an 8-second CS-UCS interval before 
conditioning and at five stages of conditioning (N = 8 at each point). 


the fact that on occasion the conditioned re- 
sponse was part of a complex of deflections 
rather than being discrete Ip. 64, italics 
added]. 


From the present point of view, double 
and other types of complex responses re- 
sult from gradual changes in the pattern 
of the CR during conditioning, changes 
which parallel the phenomenon known as 
“inhibition of delay” in Pavlovian termi- 
nology. Inhibition of delay in GSR con- 
ditioning operates to depress the earlier 
portions of the initial CR (as well as to 
delay its onset) and frequently may cre- 
ate the impression of a “double response.“ 
Eventually (after many conditioning 
trials), only à very reduced and delayed 
response occurs, if there is any response 
at all. The present approach attempts to 
comprehend this change by observing 
trial-by-trial changes in the lateney and 
pattern of the GSR to the CS. 

In a recent experiment (Kimmel, 1963) 
a 1000-cps tone was used as the CS and 
an electric shock as the UCS. Eight sub- 
jects came to the laboratory at the same 
hour on 9 successive days and were given 


eight paired presentations of the CS and 
UCS on each day. The CS had a dura- 
tion of 8 seconds and was followed im- 
mediately by a 1-second shock. The 
amount of difference between the basal 
resistance (pretrial) and the resistance 
at the peak of the first deflection of the 
recorder pen was measured at each 1- 
second point during the CS-UCS interval. 
This was done for each subject sepa- 
rately on each trial. These values were 
transformed into units of log conductance 
change and averaged for selected pairs of 
adjacent trials (three pairs of trials on 
the first day and the last two trials on 
the second, fourth, and eighth days). 
These data are shown in Figure 1. 
Figure 1 shows that early in training 
the GSR reaches its peak magnitude after 
very few seconds and remains there dur- 
ing the entire CS-UCS interval. This 
is the reaction that Stewart et al. con- 
sider to be a sensitized original reaction 
to the CS. As training progresses, the 
strength of the early portion of this 
initial reaction declines, its latency in- 
creases, and its peak shifts toward the 
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Fic. 2. Average latency of initial GSR 
deflection following the CS on each of 50 
conditioning trials. (N = 50 at each point; 
a latency of 9 seconds was used on trials on 
which no response occurred.) 


time of delivery of the UCS. Attention 
is drawn to the possibility that the sec- 
ond “deflection’ noticeable in the bottom 
three panels of Figure 1 may not consti- 
tute a unique “second” reaction to the 
CS but, rather, may be what remains of 
the total reaction shown in the third panel 
(following 7-8 reinforcements), after the 
early and middle portions of the response 
have begun to be attenuated. Treating 
the early and late portions of the reaction 
as if they were different reactions, their 
frequency curves over trials would show 
the relationship reported by Stewart et al. 
(and in this sense replicate their find- 
ings). 

A more elaborate study of the same 
phenomenon ® indicates in greater detail 
exactly how continuous this shift in the 
latency of both the onset and the peak of 
the response to the CS actually is. In 
this study 50 subjects were conditioned 
at a 7.5-second CS-UCS interval, using 
a circular red light as the CS (produced 
by a Grayson-Stadler multiple stimulus 
projector) and an electric shock at the 
UCS. The shock had a duration of .1 


2 W. A. Greene, Ellen Kimmel, J. H. 
Kramer, and R. C. Martin collected the data 
in this experiment. Mrs. Kimmel’s partici- 
pation was supported by her National Insti- 
tutes of Health predoctoral research fellow- 
ship PHS-MH-16798-02 (6S1A57). 
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second and occurred at the end of the CS. 
Figure 2 shows the average latency of 
the first response to the CS on each of 
the 50 conditioning trials. An arbitrary 
value of 9.0 seconds was used in all cases 
of no response (on the assumption of a 
minimum latency of 1.5 seconds for the 
UCR). The use of the 9-second value 
was necessitated by the obvious fact that 
as latencies get longer and longer they 
eventually become “infinite,” in that no 
CR at all occurs. Omission of scores on 
trials on which no response occurred 
would produce an analytical problem of 
unequal Ns and would offer too con- 
servative an indication of the actual trend 
in the data. Nevertheless, our examina- 
tion of the same data as are shown in 
Figure 2, with the zero-response trials 
omitted, revealed a gradually increasing 
trend of somewhat lower slope than shown 
in the figure. 

As is indicated in Figure 2, the latency 
of the initial response to the CS moves 
upward in a continuous fashion from the 
first conditioning trials to the last. The 
straight line drawn through the data 
points was fitted by the method of least 
squares to focus attention upon the es- 
sentially r nature of the trend. The 
slight drop in latency in the first few 
trials (e.g, Trial 5) may reflect an 
early sensitization effect, although sheer 
speculation is involved here. Consider- 
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Fic. 3. Average latency of maximum 
GSR deflection following the CS on each of 
16 blocks of three conditioning trials, omit- 
ting Trials 1 and 50. (Latencies for zero- 
response trials were omitted and Ns reduced 
appropriately.) 
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Fic. 4. Actual GSR record of a selected subject, who received 50 trials of CS-UCS with 
a 7.5-second interstimulus interval. 


ing that each point in the figure repre- 
sents the mean of 50 subjects, no statisti- 
cal test appeared to be necessary to sup- 
port the conclusion that a gradual increase 
in GSR latency occurred. 

Also of interest in this experiment was 
the way in which the time of occurrence 
of the maximum deflection shifted toward 
the end of the CS-UCS interval. These 
data are shown in Figure 3, omitting all 
trials on which no response occurred. 
The data in Figure 3 have been grouped 
into blocks of three trials to increase their 
stability (to mitigate against the effect of 
small and unequal Ns resulting from omit- 
ting trials of zero response). It is clear 
in Figure 3 that the increase in the aver- 
age latency of the maximum GSR deflec- 
tion approximates the form of a negative 
growth function rather than a straight 
line, This is to be expected for data of 
this type, since the latency of the maxi- 
mum deflection, or “recruitment latency,” 


as it is sometimes called (Prokasy, Hall, 
& Fawcett, 1962), reflects an excitatory 
as well as an inhibitory tendency (Dos- 
sett & Grings, 1959) (since the larger 
a response is, the later will be its maxi- 
mum value). 

On the basis of the foregoing data it 
would appear to be at least equally as 
reasonable to assert that the “second” 
deflection described by Stewart et al. is 
actually part of the dynamically changing 
total CR made to the CS as it would to 
argue that it is a unique reaction in its 
own right. As they have observed, two, 
three, four, and even more apparently 
separated deflections may occur following 
the CS during a long CS-UCS interval. 
In the present view, parsimony is sacri- 
ficed and controversy kindled by assuming 
that the second deflection is not only 
unique but the only true CR. Inspection 
of a selected actual GSR record, while 
not a proper basis for sweeping generali- 
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zations, serves to illustrate the difficulty 
that attends any effort to isolate the sec- 
ond deflection. Such an example is 
shown in Figure 4. 

Figure 4 shows the actual trial-by- 
trial changes (over 50 CS-UCS trials) 
ir. the pattern of the GSR during the 
CS-UCS interval for a single subject. 
The basal resistance at the beginning of 
each trial is indicated (in ohms) at the 
top of each panel and the amplitude scale 
used throughout appears in the lower left 
of the first panel. The first heavy curved 
line shows the time of onset of the CS 
and the second heavy curved line shows 
the time of CS offset and UCS onset. 
The CS-UCS interval was 7.5 seconds. 

It can be seen in Figure 4 that either 
a second deflection or a depression in the 
total deflection appeared on the third 
trial. On the fourth trial the second 
deflection is less discrete but larger, while 
on the fifth trial it has gotten smaller 
again and is followed by a third deflec- 
tion. When we get to Trials 8 and 9 the 
second deflection has all but vanished. 
However, it is again noticeable on Trial 
11. Trials 14 and 15 show a curious 
“stepladder” phenomenon, one which is 
not uncommon in our data. Finally, by 
Trial 19, both deflections are gone com- 
pletely, although two very delayed CRs 
appeared on Trials 42 and 43. In the 
present view this record illustrates the 
gradualness and variability of CR changes 
during conditioning. It is because of this 
gradualness and variability that careful 
study of the trial-by-trial changes in the 
GSR pattern is required if we are to 
arrive at an understanding of the long- 
interval delayed conditioning process. 

Stewart et al. (1961) are quite correct 
in observing, regarding the GSR, that: 


one finds that almost no stimuli are neutral, 
and hence that the reflex action proper to the 
conditioned stimulus is the same as that 
proper to the unconditioned stimulus, though 
varying in extent [p. 60]. 


What can be done, then, to eliminate the 
possibility that what appears to be “con- 
ditioning“ may, by their definition, be 
mere “pseudoconditioning”? They would 
have us seek a GSR with latency char- 
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acteristics different from those of the 
UCR to the CS, on the grounds that 
latency differences have been utilized by 
workers with other responses (e.g., eye- 
blink and startle reflexes). Although it 
was shown in Figure 2 that the latency of 
the conditioned GSR does increase dur- 
ing training (suggesting that a latency 
criterion could be used—i.e., the cri- 
terion of increasing latency), their po- 
sition requires empirical support never- 
theless. No such criterion is used in 
studies of salivary conditioning. Short- 
latency reactions in salivary conditioning 
are typically taken as indicating greater 
strength of conditioning than long-latency 
reactions (Kimble, 1961). Yet, Stewart 
et al. reject short-latency reactions as 
sensitized original responses and identify 
long-latency reactions as true CRs. The 
fact is that latency criteria have proved 
most useful, for example in eyelid con- 
ditioning, not as a means for describing 
what a CR is, but to identify what is not 
a CR. Here, the work of Grant (1945) 
on the beta response and of Spence and 
Ross (1959) on voluntary responses have 
had a significant influence in providing å 
basis for elimination of these classes of 
non-CRs from conditioning data. And, 
of course, this question cannot be dropped 
without noting that what may be appropri- 
ate for short-latency muscular reactions 
may not necessarily be appropriate for 
long-latency glandular reactions. 

There is doubtless a need for differ- 
entiation among the various types 0 
quasi-conditioning results and actual con- 
ditioning. One must start, obviously, 
with a definition of conditioning. Stew- 
art et al. have chosen to apply to GSR 
conditioning a definition descriptively ap- 
propriate for salivary conditioning 
(stimulus substitution). It seems prefer- 
able to us to apply the label “condition- 
ing” to any S-R alterations directly at 
tributable to the paired presentation © 
the CS and UCS (Kimble, 1961) and 
“quasi-conditioning” to such changes not 
due to pairing. There appear to be only 
two acceptable procedures by which 
the conditioning-quasi-conditioning dis- 
tinction can be made. The first would 
involve complete duplication, in a control 
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group, of all the conditions characterizing 
the conditioning procedure, except the 
pairing of the CS and UCS, All that 
would then be required would be a signifi- 
cant difference between the conditioning 
and control groups (Kimmel, 1959). This 
procedure, of course, is the prototype for 
several of the more frequently used con- 
trols described in the conditioning litera- 
ture, i.e, the “backward” control, the 
“random-unpaired” control, the “long 
interstimulus interval” control, the 
“pseudoconditioning” control, not all of 
which are equally adequate. The second 
procedure for differentiating between con- 
ditioning and quasi-conditioning would 
use differential reinforcement of at least 
two CSs. If after, say, eight paired 
presentations of a 1000-cps tone and 
an electric shock, interspersed with eight 
presentations of an 800-cps tone un- 
paired with shock, the GSR to the 
1000-cps tone is significantly greater in 
magnitude than that to the 800-cps tone 
(Kimmel & Pennypacker, 1963), can we 
not attribute this difference to condition- 
ing? According to the criterion of Stew- 
art et al. that the CR not be elicited by 
the CS before training, the answer would 
have to be “No!” Both tones produced 
unconditioned GSRs before training. 
Nevertheless, if we succeeded in sensi- 
tizing the UCR to the 1000-cps tone, 
we would expect similar sensitization of 
the UCR to the 800-cps tone. Obviously, 
more than sensitization occurred, 

The differential conditioning procedure 
serves very effectively to differentiate 
between “specific S-R sensitization,” on 
the one hand, and sensitization due either 
to a lowering of the threshold of the 
particular response system for any stimu- 
lus or to an increase in general arousal, 
on the other, but does not differentiate 
between the latter two. The difference 
between GSR magnitude to the 1000- and 
800-cps tones must have been due to 
either true conditioning or specific S-R 
Sensitization. It could not have resulted 
from a general increase in GSR respon- 
Siveness (i.e reduced GSR threshold) 
nor from an increase in general arousal 
(generally reduced thresholds), otherwise 
responses to both S+ and S— would have 
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been affected. On the other hand, the 
random-unpaired control group technique, 
mentioned earlier, may isolate sensitiza- 
tion of the general arousal type. Kimmel 
(1959) ran three conditioning groups, 
each receiving a different intensity of the 
CS paired with electric shock, and three 
random-unpaired control groups, each 
receiving one of the three CS intensities 
used in the conditioning groups. The 
control groups received the same number 
of CSs and UCSs as did the conditioning 
groups in an unpaired, unsystematic se- 
quence. The comparison between the 
conditioning and control groups receiving 
the weakest intensity of the CS (35 
decibels) was the only one that showed 
a significant difference in favor of the 
conditioning group. A progressive in- 
crease during the training in GSR magni- 
tude to the CS occurred in both groups 
that received the strongest intensity of 
the CS (115 decibels). This gradual 
increase in response magnitude was shown 
to be associated with a significant in- 
crease in basal skin conductance from be- 
fore to after training, in both the con- 
ditioning and control groups receiving the 
115-decibel stimulus (but not in the other 
groups), indicating that increased general 
arousal or activation underlay this in- 
stance of quasi-conditioning. It was 
surely true in this experiment that the 
relationship between CS intensity and 
conditioning was strikingly different from 
the relationship between CS intensity and 
quasi-conditioning, the former being nega- 
tive but the latter positive. 


SUMMARY 


Having attempted to show that the 
true conditioned response identified by 
Stewart et al. may actually be a part of 
the earlier response to the CS which they 
reject as a sensitized UCR, we have 
chosen to define conditioning in such a 
way as to consider as CRs those reac- 
tions to the CS that may be qualitatively 
similar to responses elicited by the CS 
prior to training, providing that S-R 
changes are shown, by means of an ef- 
fective control procedure, to be corre- 
lated with pairing of the CS and UCS. 
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Ic may be that, for the GSR, no effective 
approach can be found to differentiating 
between alpha conditioning, or sensitiza- 
tion (due to pairing) of a UCR to the 
CS, and true conditioning, in which the 
CS does not elicit the response before 
training (although latency changes were 
described that might do this). This im- 
plication of the present discussion should 
not be distressing since differentiation of 
this sort may not be necessary either. 
Kimble (1961) has recently arrived at a 
somewhat similar position with regard 
to the difference between eyelid con- 
ditioning and pseudoconditioning or, even, 
regarding “all conditioning in which a 
noxious stimulus is employed [p. 62, 
italics in original].” 

The position taken in this paper is that 
Stewart et al. are correct in emphasizing 
the need for more adequate definition of 
the CR, but it is suggested that this be 
achieved via operational specification of 
the “phenomenon” of conditioning by 
means of effective control groups. Cer- 
tainly those studies that have employed 
adequate quasi-conditioning controls can 
be taken to be studies of conditioning 
and not, merely, studies of “adaptation 
and recovery of unconditioned responses 
[Stewart et al., p. 66].” What is re- 
grettable is that there are so few studies 
that meet even this criterion. 
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PSYCHOLOGICAL REVIEW 


TWO HYPOTHESES CONCERNING THE BIO- 


CHEMICAL BASIS OF MEMORY 


THOMAS K. LANDAUER* 
Dartmouth College 


The 1st hypothesis is that the basic event of conditioning or learning 
is the transfer of RNA molecules from surrounding glial cells into 
conducting neurons and the subsequent transformation of the protein 
synthesizing apparatus of the neuron. The transfer of RNA across 
membranes (of activated neurons only) is presumed to be due to 
electrophoretic migration in the reversed gradient during 
impulse conduction. The 2nd hypothesis is that information is 
coded for storage in the CNS in the form of frequency characteristics 
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of spreading ac potentials to which a neural membrane can become 


tuned by alteration of its protein 


structure. Together, the 2 hy- 


potheses suggest the basis of a physiological theory of learning. 


Within the last few years consider- 
able evidence has accumulated impli- 
cating ribonucleic acid (RNA), one 
of the macromolecules involved in the 
storage and utilization of genetic in- 
formation, in the processes underlying 
memory storage in the nervous system. 
RNA is found in unusually great 
amounts in the grey areas of the cen- 
tral nervous system (CNS) (Mandel, 
Harth, & Borkowski, 1961). Its con- 
centration in nervous tissue increases 
and decreases with age in correspond- 
ence with age changes in learning 
ability, and supplementation of RNA 
has been alleged to retard memory 
loss with age in humans (Cameron & 
Solyom, 1961). Supplementation of 
RNA has also been reported to im- 


_ 1On leave at Center for Advanced Study 
in the Behavioral Sciences, Stanford, during 
1963-64. I thank David S. Dennison, Joseph 
D. Harris, Wolfgang Kohler, Richard H. 
Rech, and William M. Smith for many 
helpful comments. 


prove learning and retention in rats 
(Cook, Davidson, Davis, Green, & 
Fellows, 1963), and chemicals which 
affect RNA utilization have been 
found to affect memory trace forma- 
tion in planaria, mice, and rats 
(Chamberlain, Rothschild, & Gerard, 
1963; Corning & John, 1961; Ding- 
man & Sporn, 1961; Flexner, Flex- 
ner, & Stellar, 1963). The concen- 
tration of RNA in neural cells has 
been shown to increase as a result of 
stimulation (Hydén, 1959; Morrell, 
1962), and this increase has been 
shown to be accompanied by a change 
in base ratio when the stimulation is 

of a learning process (Hydén & 
Egyhazi, 1962, 1963). Finally, Zel- 
man, Kabat, Jacobson, and McConnell 
(1963) have reported preliminary evi- 
dence that injection of planaria with 
RNA extracted from conditioned 
planaria, as compared to RNA from 
untrained planaria, may facilitate 
conditioning. 
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However, the manner in which RNA 
might enter into the learning process 
remains entirely problematical (Briggs 
& Kitto, 1962). Since RNA is inti- 
mately involved in protein synthesis, 
and thus in almost all possible struc- 
tural modifications of cells, there is 
little limit to the number of kinds of 
neural mechanisms of learning which 
might reveal themselves in changes in 
RNA. 

Suggestions to date include an un- 
specified mechanism involving direct 
coding of information on RNA mole- 
cules in terms of base sequences 
(Gaito, 1961), enzyme induction with 
respect to transmitter substances 
(Briggs & Kitto, 1962; Smith, 1962), 
changes in proteins of terminal vesicles 
(Hydén, 1961), and changes in ionic 
binding which influence firing rates 
(John, 1962). While these sugges- 
tions have much to recommend them, 
they do not by any means exhaust the 
possibilities for constructing a mecha- 
nism which accounts for the known 
facts. In the spirit of broadening the 
available spectrum of possibilities, a 
mechanism will be proposed here 
which differs in some fundamental 
ways from any suggested previously. 

The critical features of the mecha- 
nism for which we are searching are 
that it provide a means by which RNA 
of selected neurons can be altered by 
experience, and a way in which altered 
RNA can give rise to altered behavior. 
Proposal of a mechanism thus con- 
stitutes the proposal of two interlock- 
ing hypotheses, a learning hypothesis, 
and a retrieval hypothesis. The two 
hypotheses to be presented here are 
suggested by consideration of some 
of the physiological events involved 
in classical conditioning. Together 
they suggest the rudiments of a theory 
of the processes underlying at least 
some forms of learning and memory. 
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HyrorHesis I: LEARNING 


The first hypothesis, the learning 
hypothesis, is suggested in part by the 
fact that classical conditioning does 
not occur if the unconditioned stimulus 
(US) commences before or simulta- 
neous with the conditioning stimulus 
(CS) (Kimble, 1961). The optimal 
condition is, rather, a .5-second inter- 
val between CS and US. Thus, rather 
than a mechanism by which the simul- 
taneous activity of two neural elements 
results in a structural modification 
(e.g., Hebb, 1949; John, 1962), it may 
be more fruitful to look for a mecha- 
nism by which sequential action in- 
duces modification. af 

The neuronal events of the CS-US 
sequence, of course, consist of conduc- 
tion of impulses and accompanying 
transient changes in membrane poten- 
tials, ionic concentrations, and mem- 
brane permeabilities throughout the 
system of cells involved. How might 
such changes produce alteration of the 
structures of neural elements? 

A possible explanation would be 
available if impulse conduction re- 
sulted in the passage of new forms of 
RNA into neurons. If new RNA, 
which in some way represented the 
CS, were produced in the glia sur- 
rounding a neuron for a short period 
following the CS, but entered only 
neurons which then conducted im- 
pulses, new RNA would be incorpo- 
rated by just those neurons activated 
by the US. If the result of incorpo- 
rating new RNA which “represented” 
the CS (in a manner to be discussed 
below under Hypothesis II) were to 
alter the recipient cell so as to make 
it more likely to fire in the presence 
of the CS, the basic event of condition- 
ing would have occurred. 

Of course, macromolecules such as 
RNA do not readily cross cell mem- 
branes, although there are known cases 
in which they do (eg., Mannick & 
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Egdahl, 1962; Schwartz & Rieke, 
1962). However, there are dramatic 
changes in the membrane during im- 
pulse conduction which must tend to 
promote such migration. Specifically, 
the potential across the membrane (of 
the Squid axon) goes from approxi- 
mately 60 millivolts negative inside to 
approximately 40 millivolts positive 
inside, the positive, or overshoot, por- 
tion of the impulse lasting about .5 
millisecond (Tasaki, 1959). This rep- 
resents a change from a resting poten- 
tial gradient across the 50 A° thick 
membrane (Eccles, 1959) of 1.2 x 10° 
volts per centimeter to a transient po- 
tential gradient in the opposite direc- 
tion which averages approximately 5.1 
X 10* volts per centimeter (assuming 
the overshoot portion is approximately 
sinusoidal). That this change in forces 
should strongly promote the passage 
of RNA into the neuron is illustrated 
by the following calculations. The 
electrophoretic? mobility of yeast 
RNA (Chargaff & Zamenhof, 1948) 
at the pH of the brain (in a medium 
of phosphate buffer containing 0.15 M 
NaCl) is —12.1 x 10° cm? volts per 
second. This means that RNA in a 
gradient of 5.1 X 10* volts per centi- 
meter, as present during impulse con- 
duction, would migrate towards the 
Positive pole of an electrical field at a 
Tate of just over 6 centimeters per 
Second (in the stated medium) and 
Over the .5 millisecond of the re- 
versed potential would travel approxi- 
mately .003 centimeter, or some 6,000 


_ *Electrophoresis refers to the migration 
in an electric field of molecules as a result 
of the “atmosphere” of small ions which 
Surround them. The rate of migration ex- 
Pressed as centimeters per second per volt 
Der centimeter of the potential gradient is 
nown as the electrophoretic mobility. The 

ection towards anode (—) or towards 
cathode (+) and the rate of migration is a 
function of the pH (as well as other 
Properties) of the medium of diffusion. For 
urther details, see Bull, 1951, pp. 161-191. 
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times the thickness of the membrane. 
While there are a number of reasons 
to expect less migration across mem- 
branes in vivo, the size of the forces 
involved indicates that a considerable 
push toward the inside of the cell is 
given to any RNA surrounding it by 
the change in potential during conduc- 
tion. Moreover, if the surrounding 
RNA has a “natural” tendency to 
enter neural cells (as, for example, 
the RNA of tobacco mosaic virus 
seems to have for plant cells, Fraenkel- 
Conrat, Singer, & Williams, 1957; and 
is apparently the case for several 
mouse tissues, Schwarz & Rieke, 1962; 
and amphibian neural tubes, Ficq, 
1954), then an even more important 
factor would be the positive-outside 
resting potential which would act to 
shield the neuron against “infection” 
by new RNA except during con- 
duction. 

Thus, the proposed hypothesis is as 
follows. Glial cells are supposed to 
be specialized to provide RNA which, 
if it enters a neuron, will produce an 
alteration such that the neuron will 
become more sensitive to the events 
going on in the brain at the time the 
RNA was produced. Ordinarily, a 
barrier consisting of a potential gradi- 
ent and membrane impermeability pre- 
vents the passage of this new RNA 
into neurons. However, when a 
neuron fires, this barrier is replaced 
by a potential gradient which actively 
promotes migration of RNA into the 
neuron. Thus, conducting neurons, 
and only conducting neurons, incorpo- 
rate the newly available RNA. 

If, at the time of conduction, the 
events going on in the brain were only 
those associated with the US, the 
incorporated RNA would be expected 
to be no different from that already 
present, and no change in the RNA 
population of the neuron would re- 
sult. However, if a new stimulus, a 
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CS, had been acting very recently, new 
forms of RNA would be produced and 
incorporated, and the population of 
RNA types in the conducting elements 
would be altered as a result. The new 
RNA might act on the cell in a man- 
ner analogous to that of the “trans- 
forming principle” of bacteriophage 
(Ephrussi-Taylor, 1957) (which, how- 
ever, consists of DNA rather than 
RNA) or the infectious RNA of 
tobacco mosaic virus which can pro- 
duce modification of the protein syn- 
thesizing apparatus of the cell such 
that it thereafter produces the types 
specified by the invading RNA 
(Fraenkel-Conrat et al., 1957; Wil- 
liams, 1959).° This picture agrees 
with the findings of Hydén, and co- 
workers (Hydén & Egyhazi, 1962, 
1963; Hydén & Pigon, 1960) that 
RNA decreases in the glia and in- 
creases in neurons during activity ; that 
an alteration in base ratios occurs only 
when the stimulation is part of a learn- 
ing process; that the new RNA is un- 
like that “native” to the neuron in- 
volved; and most significantly, that 
RNA changes occur in the surrounding 
glia as well as in the neurons involved 
in learning. 


HvrorHESTS II: RETRIEVAL 


The second hypothesis—the retrieval 
hypothesis—proposes a solution to the 
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problem of how an altered population 
of RNA molecules and a resulting 
transformation of the protein synthe- 
sizing apparatus of the neurons acti- 
vated by a US might make these ele- 
ments responsive to the CS. 

It should be noted that Hypothesis 
I could provide a mechanism by which 
synaptic changes, such as those postu- 
lated by Hebb and others, might occur. 
For example, the successive firing of 
two neurons which abut each other 
might result in a change in the protein 
structure of the membranes or ter- 
minal vesicals of the synapse such that 
the sensitivity of the second to stimu- 
lation by the first might be increased. 
The process involved could, for in- 
stance, be one of enzyme induction, in 
somewhat the manner proposed by 
Briggs and Kitto (1962) and Smith 
(1962). The mechanism of incorpo- 
ration by electrophoresis would pro- 
vide the necessary selective mecha- 
nism by which synaptic resistance 
would be lowered only between suc- 
cessively fired adjoining neurons. 
Given this basis for neural “associa- 
tion,” the phenomena of learning and 
memory might be accounted for by the 
same sorts of theoretical machinery 
already so ably elaborated by such 
neural circuit designers as Hebb 
(1949) and Pitts and McCulloch 
(1947). However, in line with the 
general goal of this paper—to suggest 
alternative explanations to those now 
current—the temptation to follow well- 
worn pathways will be eschewed. 
Rather, a more radical hypothesis will 
be proposed, which seems to provi E 
for a number of the “sophisticated 
properties of the brain on the basis o 
a relatively simple mechanism. 

The hypothesis is suggested in part 
by the fact that it is actually possible 
to observe electrical events associate 
with performance of conditioned re- 
sponses with a fair degree of regular- 
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ity and detail (eg., John & Killam, 
1960). If conditioning were solely a 
matter of the alteration of synaptic 
connections between individual pairs 
of neurons, the probability of observ- 
ing activity specific to any one reac- 
tion would be very small indeed, as 
elements involved only in a specific 
reaction would be at least as small a 
proportion of the total brain as the 
conditioned reaction was of the 
animal’s behavioral repertoire. Of 
course, electrical “signals” in the 
higher centers of the central nervous 
system are not confined to the firing 
elements, but, since neurons in these 
regions are unmyelinated, spread read- 
ity into surrounding tissues. Thus, 
recording electrodes need not actually 
be on those neurons which are 
imagined to have synaptic relations 
with each other in order to record elec- 
trical events generated by the elabora- 
tion of a conditioned reaction. How- 
ever, it is hard to believe that this 
spread of electrical activity, and the 
perhaps specialized unmyelinization 
which allows it, serve no other func- 
tion than to make the electrophysiolo- 
gist’s job easier and save a little space 
inside the skull case. 

The suggested alternative is that 
Conditioning consists not of decreased 
Tesistance of the synapse between one 
neuron and another, but rather in a 
change of the sensitivity of a given 
neuron to some characteristic of the 
Spreading electrical activity generated 
by the CS, 

It is proposed that individual neural 
cells selectively propagate and are ex- 
cited by alternating currents of spe- 
cific frequencies. As will be discussed 
below, neural membranes have certain 
electrica] properties which could serve 
this function. In turn, the electrical 
Properties of the membrane (or its 
ghost” ionic envelope) are very likely 
à function of the structure of its pro- 
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tein layers, and these are in turn a 
function of the species of RNA in- 
volved in their synthesis. Thus, fre- 
quency specific sensitivity of the 
neural cell membrane provides an at- 
tractive possibility as a mechanism for 
signal selection based on biochemical 
structure. 

Imagine that the membrane of each 
neuron (and its surrounding glia) 
constitutes an electronic filter tuned to 
some reasonably narrow band of fre- 
quencies. Then with conduction of 
an impulse the currents which spread 
from the neuron into surrounding tis- 
sues might have characteristic fre- 
quency patterns, probably represented 
in the case of a single impulse as the 
shape of the spike potential. These 
spreading potentials would then pene- 
trate other neurons selectively, as de- 
termined by the frequency specific 
admittance characteristics (tuning) of 
their membranes. If a given neuron 
were tuned to just that band of fre- 
quencies which was present, the result 
would be at least partial depolarization 
of its membrane. Even if this de- 
polarization were insufficient to fire the 
neuron, it might bias it to fire when 
provided with additional sources of 
excitation, e.g., synaptic chemical ex- 
citation from other neurons and/or 
potentials arising from intrinsic brain 
rhythms. 

If the tuning of a given membrane 
is determined by the types of proteins 
of which it is constructed, then pas- 
sage of transforming RNA from glia 
to a neuron might result in a change 
in its resonant frequency. If this 
change increased the probability of the 
neuron firing when exposed to fre- 
quencies characteristic of the CS, con- 
ditioning would have occurred. 

Since each neural cell seems to have 
a unique biochemical composition 
(Sperry, 1951), it is conceivable that 
the membrane of each cell has a unique 
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natural frequency or set of frequencies 
(a particular shape of pulse) and thus, 
in effect, is able to broadcast its genetic 
“code name.” If a neuron which ini- 
tially was unresponsive to the signal 
generated by firing of the neurons ac- 
tivated by a CS were “tuned” to re- 
spond to it, the functions of that 
neuron would come under the control 
of the CS. But, by Hypothesis I, the 
only neurons which can become so 
tuned are those which fire shortly after 
the CS, i.e., those initially under con- 
trol of the US. Thus the CS would 
become a substitute for the US in 
eliciting neural activity. 


INTERLOCKING OF HYPOTHESES 
I ano IT 


For the next step in the process, it 
becomes necessary to simply assume 
that the presence of particular FM sig- 
nals results in the release of just those 
species of RNA which, if incorpo- 
rated, cause a neuron to form mem- 
branes sensitive to the same frequen- 
cies. This, of course, leaves an 
important problem unexplained, since 
it is crucial to the theory that a 
mechanism for such RNA specifica- 
tion exist. How can a given frequency 
of electrical activity result in availabil- 
ity of RNA which breeds membranes 
sensitive to that frequency? Hydén 
(1962) has proposed that Ae 
structional” specification of new nu- 
cleotide arrangements is induced by 
ionic fluxes, although Briggs and Kitto 
(1962) and Smith (1962) have ob- 
jected that there is no chemical prece- 
dent for such a process. The alter- 
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native of a “selective” process 

appealing in the present context, sing 
it can be imagined that the glia a 
capable of providing a wide a ner 
of types of RNA unlike those of f 
neuron (their average base ratios ar 
indeed, different according to Hydé 
and Egyhazi, 1962). Selection amo 
these might proceed through stimuli 
tion of synthesis of resonating mo 
cules, destructive selection ag 


membrane permeability. In any 
if appropriate RNA types were 
leased by glia during, or shortly afi 
the passage of a frequency coded sigi 
representing the CS, and subsequen 
were absorbed by those neurons whi 
fired, the appropriate neurons mi 
be transformed so as to become m 
likely to fire in the presence of the 


In ary, the elaboration of 
CS-US association is hypothesized 
occur as follows. A CS causes 4 
unique pattern of impulses in neure 
of the cortical and subcortical projec- 
tions of the afferents which it excit 
Since each of these projection neui 
has a unique biochemical “name,” 
propagates a spreading potential W 
characteristic ac frequency com 
nents. The compound FM signal ti 
produced is distributed widely in 
brain, particularly in the “associate 
areas of the frontal and tempo 
poles, following the paths of least € 
trical resistance, e.g., flowing thro 
the neuroglia. Glial cells through 
the regions reached by these sign 
are then, somehow, induced to ma 


of RNA is turned on and off is a d 
enough mystery at present that, it seems 
me, no conjecture ought to be considere 
unreasonable simply for lack of precedent: 
It seems at least conceivable, for examp 
that some sort of principle involving elec 
trical and/or mechanical resonance of mo 
cules which block parts of DNA cha 
could be involved. 
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available RNA types which can pro- 
duce membranes which respond to the 
same frequencies. These RNA mole- 
cules are presumed to accumulate at 
the glial-neural junctures during the 
CS-US interval. A US occurring at 
this point in time causes a certain 
number of neurons to fire, and the 
reversed electrical field of the action 
potential results in the electrophoretic 
migration of new RNA into these 
neurons. This RNA is then assumed 
to “transform” the protein synthesiz- 
ing apparatus of the incorporating 
neurons in such a way that their mem- 
branes become sensitive to the FM sig- 
nals which represent the CS. Thus, 
on subsequent occurrences of the CS 
these neurons are more likely to fire. 


Some IMPLICATIONS AND 
RAMIFICATIONS 


It may be useful to sketch the 
nature of a few major implications 
which these hypotheses would have for 
overall brain theory. One is that the 
function of the brain would depend 
not only on the neurons, but also on 
the neuroglial cells, as Galambos 
(1961) has proposed. Galambos’ rea- 
sons for suspecting glia of more im- 
portant functions than mere meta- 
bolic and mechanical support include 
the numerical preponderance of glial 
cells in the brain and the phylogenetic 
increase in the ratio of glia to neurons. 
The role proposed here for glia would 
be to provide preformed “programs,” 
to use Galambos’ language, bearing in- 
structions to the absorbing neuron to 
respond to events representing certain 
cerebral inputs. 

y A second broad implication is that 
‘association areas” need not be or- 
ganized in a highly discrete manner. 
The envisaged process would occur 
most effectively in the densely packed, 
unmyelinated jungle of apical dendrites 
of the cerebral cortex where spreading 


impulses can reach large numbers of 
cells. The process involves essen- 
tially ephatic (or, following Galambos, 
“gliatic”) rather than synaptic 
changes, and thus might be consider- 
ably restricted by myelinization. Thus, 
also, “conditioning” should be possible 
in any part of the nervous systems 
which contains densely packed un- 
myelinated neurons surrounded by 
glia (Chamberlain et al., 1963). The 
particular role of projection cortex im- 
plied by the theory is contained in its 
exquisite display in anatomical order 
of impulses arising from sensory stim- 
ulation. It transduces differences 
along stimulus dimensions to differ- 
ences in anatomical location. The 
anatomical order, in turn, probably 
must be based on a biochemical dif- 
ferentiation, since it is retained in re- 
generation (Attardi & Sperry, 1963) 
and thus, on the present hypothesis, 
gives rise to differences in spreading 
signals which are characteristic of 
sensory inputs. For association all 
that is required structurally is that 
elements involved in response have 
rich connections with areas which also 
have rich connections with sensory 
projections. No more specific “path- 
way” organization is required, since 
“pathways” need consist only of con- 
ducting media which will allow the 
spread of FM signals. 

The picture of neural interaction 
which is implied is somewhat different 
from traditional doctrine limited to 
neuron-neuron synapses mediated by 
chemical transmitters. However, it is 
not as radical a departure as it might 
seem at first glance. The major dif- 
ference lies in utilization of the vast 
array of membrane—membrane juxta- 
positions found within the grey areas 
of the CNS which are not composed 
of two neurons, i.e., the neuron-glia 
and glia-glia interfaces. In a sense 
this view is more consistent with 
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or group of neurons could become 
cond: izati to nearby 
frequencies, and to harmonic and beat 


frequencies could be expected, 
might form the basis of a 
“categorization” of diverse ‘inputs. 
Again, unspecialized cells which fired 
more or less randomly after 

events or before responses could come 
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to serve as relay stations, which might 

mediate between stimuli and responses 

and thus act as percepts or even 
A iables.” 


“intervening z 

Additional complexity and flexibility 
would be provided by the classical 
neuron-neuron synaptic mechanisms, 
While tuning provides a recognition or 
coding mechanism, it does not by itself 

adequate combinatorial or 
mechanisms. For example, it 
does not account for inhibition. It 
could be hypothesized that inborn, 
perhaps unmodifiable neuron-neuron 
synapses of the classical varieties make 
possible such features of nei al ac- 
tivity. For example, -tunin,, of a 
neuron which is genetically specialized 
as an inhibitor of a given response 
would produce an inhibitory CS. In 
a similar manner the conditioning of 
combinations of neurons with “natu- 
rally” convergent paths to different 
stimuli would allow synaptic summa- 
tions to produce “novel,” responses. 
If it is imagined that the regions in 
which conditioning occurs contain a 
wide variety of neurons, some excita- 
tory, others inhibitory, some with 
many connections, others with few, 
then the combinatorial logic for infor- 
mation processing made available by 
allowing each to be selectively respons- 
ive to particular coded inputs (includ- 
ing signals generated by interneurons) 
becomes almost unlimited. 

With elaboration, the hypotheses 
might also suggest new interpretations 
of such as consolidation, 
Operant reinforcement, and discrimi- 
nation—in fact, of course, of all the 
functions attributable to plastic prop- 
erties of the nervous system—but an 
attempt to deal with all of these here 
obviously would be premature. Con- 
sequently, the foregoing sample of im- 
plications will be assumed to suffice to 
indicate some of the theoretical direc- 
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tions in which these hypotheses might 
lead. 
Some VERIFICATIONS 


The ideas advanced here are testable 
on two different levels: directly, by 
investigation of the cellular and molec- 
ular events which are hypothesized, 
and indirectly, by study of the physio- 
logical and behavioral 
which they imply. A few instances 
will be outlined in which evidence al- 
ready exists. 


Direct Evidence 


The findings of Hydén and his col- 
laborators (Hydén & Egyhäzi, 1962, 
1963), which have already been men- 
tioned, indicate that learning involves a 
change in the RNA of neurons and 
their surrounding glia, such that the 

ial cells lose RNA, the neurons gain 

A, and the variety of RNA in the 
glia and neurons is changed ina similar 
way. In rats which learned to climb 
& string to a food platform, increases 
in RNA were found in neurons of 
the vestibular nucleus and other di- 
rectly connected neurons, but not in 
other parts of the brain. This increase 
in neuronal RNA, and an accompany- 
ing decrease in glial RNA was also 
found in control rats which received 
Passive vestibular stimulation by cen- 
trifugation. However, changes in the 
base ratio (coding) of the RNA were 
found only in the vestibular neurons 
and glia of trained rats. All of these 
findings would be expected if a given 
type of RNA molecule were produced 
in glia and then transferred to the 
neuron. However, to prove that actual 
migration across the membrane during 
Conduction is responsible for these 
changes, will require further careful 
investigation, perhaps by use of tracer 
experiments. 
The question as to whether the 

A change is associated with a 
change in membrane constitution is 
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protein types, cg, 
lipoproteins, which occur primarily in 
the membrane by electrophoretic or 
immunochemical techniques, before 
and after learning, would be one way 
of searching for such changes. 
There does not seem to be any direct 
evidence available relevant to Hy- 
pothesis II. However, it should not 
be impossible to detect selective sensi- 
tivity of cortical elements to specific 
frequencies of alternating currents or 
to detect differences in the frequency 


(1941) gives an equivalent circuit, 
based on ac bridge measurements, for 
the squid axon in which the membrane 
acts like a capacitor in parallel with 
a series resistor, inductance, and rec- 
tifier. Such a circuit has a natural 
or resonant frequency f, = 1/(2r 
VIC). When the resistance is less 
than 4 VLC, the current through the 
circuit oscillates at f,, and when the 
resistance is much greater than this 
value, the circuit is overdamped at f,. 
The rectifying properties of the mem- 
brane are such that resistance to in- 
ward current is approximately one 
hundred times that for outward 
(which, by the way, would prevent 
dc interneuronal transmission without 
chemical mediation). Thus, with the 
proper choice of values for each of 
the elements it would be possible to 
construct a circuit which would act 
either as an oscillator (when a de po- 
tential was applied) or as a filter for 
ac currents, Outward flowing pulses 
would be given a particular shape, 
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wills’ ariig polses of tec’ sec 
shape would generate the largest volt- 
ages across the circuit (and pee ai 
produce depolarization contributing 
to a self-sustaining neural impulse). 
Thus, neural membranes are known to 
have ies which could make the 
broadcast of the “name” of a cell and 
the recognition of that “name” by a 
second cell—and thus Hypothesis II 
possible. However, there is no evi- 
dence that neural cells do in fact func- 
tion in this fashion or that such 
properties are related to learning. 
While frequencies of stimulating cur- 
rents which produce minimums in 
threshold values have been observed 
for peripheral nerve and muscle fibers, 
and for central stimulation of certain 
kinds (Curtis & Cole, 1950; Rashev- 
sky, 1950), the optimum frequencies 
in these cases being in the range of 
about 10 to 1,000 cps, differential 
optima for different neurons or regions 
do not seem to have been reported. It 
is conceivable that the morphologically 
specialized neurons of the cortex have 
membranes with extremely high reso- 
nant frequencies, perhaps in the high 
audio or RF ranges, due to very low 
inductance values. (It has even been 
suggested that the molecular structure 
3 ee constitutes a 
soli e device capable of propagat- 
ing coherent electromagnetic energy 
in the VHF or IR range; Fernandez- 
Moran, 1962.) If such were the case, 
it would not be too surprising that fre- 
quency tuning had gone unnoticed. 
Nonetheless, in the present state of 
knowledge, a frequency specific tun- 
ing mechanism in the brain must re- 
main purely hypothetical. 


Indirect Evidence 


A number of well-known but hard 
to explain phenomena of brain psycho- 
physiology seem comprehensible on the 
basis of the present hypotheses. For 
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example, the small size of the proposed 
memory record—one or a few cells— 
and the nonspecificity of its pathways 
would give the system the extensive 
insensitivity to damage classically de- 
scribed by Lashley (1960). Large 
areas might have to be removed before 
any of the few cells involved in par- 
ticular habit were affected, and con- 
nections” would not necessarily be sev- 
ered by a scalpel’s slice (or even by a 
gold or mica sheet which could be 
penetrated or by-passed by an ac 
field). The small area used for rep- 
resentation of an association would 
also give the brain the immense stor- 
age capacity required for multiple 
representation, which would provide 
additional indifference to insult. 

One of the most striking features 
of brain function is the very marked 
temporal ordering of memory, which 
is seen not only in the facts of classical 
conditioning already mentioned, but 
also in the predominance of forward 
associations in human verbal learning 
(eg., Raskin & Cook, 1937), and in 
such common experiences as inability 
to recall the previous letter in an alpha- 
betical sequence (what comes before 
p?) without starting at an earlier let- 
ter and reciting forward. Just such 
a linear quality of memory, of course, 
follows from Hypothesis I. 

_ On a more “molecular” level it is 
interesting to consider the relation of 
the proposed mechanism to the recent 
findings of McConnell (1962) and 
Corning and John (1961) of what ap- 
Pears to be purely biochemical transfer 
of memories in planaria. Following 
a report by McConnell, Jacobson, and 
Maynard (1959) that planaria retain 
conditioning after being transected an 
allowed to regenerate, Corning an 
John (1961) showed that only the 
head end retained the habit if regen- 
eration took place in a medium con- 
taining ribonuclease, an enzyme which 


decomposes RNA. More recently 
McConnell (1962) has reported that 
digestion by untrained planaria of 
trained planaria resulted in savings in 
conditioning, and Zelman et al. (1963) 
have obtained evidence suggesting 
similar results from injection of an 
RNA extract of conditioned animals. 
It is obviously impossible to state, on 
the basis of the present theory, exactly 
how such a phenomenon would occur. 
However, one speculation might be 
that allowing altered cells to direct 
resynthesis of lost cells, as in regen- 
eration, or feeding altered cells to an- 
other organism as in the “cannibalism” 
studies might “infect” the new animal 
with RNA which would make its 
neurons more sensitive to the CS. 
Only generalized “sensitization” would 
be expected, but this is not too im- 
plausible an explanation since the 
response was a very generalized con- 
traction involving a large portion of 
the animal’s effectors. Insofar as sav- 
ings on relearning rather than direct 
transfer occurred in the cannibalism 
experiment, the diet of sophisticated 
worms may have provided only extra 
CS-coded RNA, which had to be in- 
corporated by firing US-controlled 
neurons before it became functional 
for its consumers. 
Finally, if the specification of signal 
(CS) sensitivity is carried by more- 
or-less freely migrating RNA mole- 
cules, a blood-brain barrier makes a 
good deal of adaptive sense. Not only 
could there be a need to prevent ex- 
traneous RNA from reaching neurons, 
but also there could be a need to keep 
the peculiar “coded” molecules of the 
brain out of the general circulation 
where they might induce immune 
reactions. 


SUMMARY AND DISCUSSION 


Two hypotheses concerning a pos- 
Sible molecular basis of memory have 


“Two Hyporueses on Puysiovocicat Basis or Memory” 


177 


been offered. The first is that the 
basic event of conditioning or learning 
is the transfer of RNA molecules 
from surrounding glial cells to con- 
ducting neurons and subsequent trans- 
formation of the protein synthesizing 
apparatus of the neuron. The second 
is that information is coded for stor- 
age in the CNS in the form of fre- 
quency characteristics of spreading 
ac potentials to which the membranes 
of neurons can be tuned. 

The notion of a chemical interaction 
between glia and neurons is based in 
part on the recent findings of Hydén 
(Hydén, 1961; Hydén & Egyhazi, 
1962, 1963), and is at least vaguely in 
line with theoretical suggestions of 
Galambos (1961), while the idea of 
coding in terms of resonant frequen- 
cies is reminiscent of the theory of 
“modulation” and “resonance” once 
proposed by Weiss (1952) to account 
for developmental innervation. The 
suggested mechanism by which trans- 
fer of RNA occurs selectively to active 
(US-UR) neurons is electrophoretic 
migration in the potential gradient of 
the action potential. Transformation 
of the incorporating neuron by the 
RNA is assumed to be somewhat 
analagous to that in viral or bacterio- 
phage infection (Williams, 1959). 
Selective tuning is suggested as pos- 
sibly based on the electrical reactance 
and rectifying properties of the mem- 
brane as described by Cole (1941). 
No mechanism is suggested for the 
specification of the RNA type to be 
transferred. The lack of such a 
specification mechanism would, of 
course, represent a critical flaw in the 
theory, even in the event that all the 
other postulated mechanisms were cor- 
rect. Principally to make this lack 
clear, the proposal has been stated in 
terms of two separate and “freestand- 
ing” hypotheses, instead of in the guise 
of a unified theory. 
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It is obvious that these proposals 
are highly speculative. There is no 
direct evidence that the brain uses the 
mechanisms suggested, and the indi- 
rect evidence is consistent rather than 
compelling. However, some of the 
ideas are quite different from current 
conceptions of brain mechanisms in 
learning, and it is hoped, therefore, 
that they will be of some interest. 
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PERSONAL CONSTRUCTS, RULES, AND THE LOGIC 
OF CLINICAL ACTIVITY ' 


T. MISCHEL 
Colgate University 


The first part of this paper examines Kelly’s Psychology of Personal 
Constructs and argues that constructs are not like hypotheses used to 
predict what will be done, but are like rules used to decide what should 
be done. The foregoing distinction is used to clarify the nature of 
personal constructs and their relation to behavior. The second part of 
the paper examines the logic of explanations and predictions based on 
such constructs. It argues that what is done by the personal construct 
clinician requires a “logical reconstruction” which differs sharply from 
the one provided by Meehl. The nature of explanations based on per- 
sonal constructs, the way they can be empirically justified, and the 
relation of such explanations to predictions are analyzed. 


Are Constructs PREDICTIVE 
“HYPOTHESES” ? 


The Fundamental Postulate of 
Kelly’s (1959) theory is this: “A per- 
son’s processes are psychologically 
channelized by the way in which he 
anticipates events [p. 46].” His the- 
ory thus explains behavior in terms of 
an “anticipatory rather than a reactive 
system [p. 170].” Kelly’s fundamental 
point is: 
that human behavior may be viewed as basi- 
cally anticipatory rather than reactive, and 
that new avenues of behavior open them- 
selves to a person as he reconstrues the 
course of events surrounding him. Thus a 
thoughtful man is neither the prisoner of his 
environment nor the victim of his biography. 

The patterns of man’s construction are 
called constructs; and since each person sets 
up his own network of pathways leading into 
the future, the concern of the psychologist is 
the study of personal constructs [p. 560]. 


These constructs are, allegedly, like 
hypotheses in that a man using con- 
structs is like a “scientist, ever seeking 
to predict and control Ip. 5] . . . the 
constructs one places upon events are 
working hypotheses, which are about 


1The author wishes to express his grati- 
tude to the Colgate Research Council for a 
grant which enabled him to complete this 
study. 


to be put to the test of experience [p. 
72).” Indeed, “constructs are hypothe- 
ses. Prediction is the goal [p. 940].” 

Just what is a construct and how is 
it used to predict? Kelly (1955) holds 
that constructs are “ways of construing 
the world [p. 9]” and “by construing 
we mean ‘placing an interpretation’. 
. . . In construing, the person notes 
features in a series of elements which 
characterize some of the elements and 
are particularly uncharacteristic of the 
others. Thus he erects constructs of 
similarity and contrast [p. 50].” Fear 
versus domination is one of Kelly’s 
(1955) illustrations of a constructa 
man who uses this construct divides 
the people he knows on the basis of this 
similarity-contrast dimension into those 
he is afraid of and those he can domi- 
nate (pp. 81-82). 

Let us apply this to a particular 
instance of behavior; I insult my boss. 
Just how does this action involve @ 
prediction? How is it “channelized” 
by the constructs I use to anticipate 
events? Kelly would say that if I 
construe people along the fear-domina- 
tion axis, then I may perceive the boss 
as similar to some other people (e. g., 
my father), the basis of the similarity 
being that they tend to dominate others 
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and are thus different from those who 
live in fear of others (the contrasting 
pole of my construct). I will, there- 
fore, expect the boss to try to dominate 
me. If I construe my own role on the 
domination side of this construct, I 
may decide to insult him so as to show 
him that I cannot be dominated, I am 
someone he needs to fear. In this way 
my action has been “psychologically 
channelized” by the way I anticipate 
events. More generally, we can un- 
derstand what people do in terms of 
their personal constructs because if 
people construe situations differently 
“they will anticipate them differently 
and will behave differently as a con- 
sequence of their anticipations Ip. 90].” 

The question I want to examine is 
whether there is a significant analogy 
between the process of construing 
which leads me to insult the boss and 
the use of hypotheses to make predic- 
tions. 

1. The way I construe the situation 
may lead me to think “I will insult the 
boss because he will try to dominate 
me if I don’t show him what’s what.” 
But this is not a prediction of what J 
will do any more than “I will take an 
aspirin because I have a headache” is 
a prediction. For it would be absurd 
to think that in such cases I infer what 
I am going to do from evidence which 
follows the “because.” I am not offer- 
ing the fact that I have a headache, or 
think the boss will dominate me, in 
support of a prediction which I am 
making about my own future behavior. 
In such contexts “I will” does not pre- 
dict what I am going to do, but ex- 
Presses my intention (decision) to do 
it; what follows the “because” is not 
evidence, but a reason which I give to 
justify what I am going to do as the 
appropriate thing to do. 

2. Does the way I construe the boss 
lead me to predict what he will do? 
If I construe the boss as a dominating 


type I will, of course, expect him to 
behave in dominating ways. But this 
does not suffice to show that I am mak- 
ing a prediction. Pollyanna “expects” 
every cloud to have a silver lining, Mr. 
Smith “expects” to be home for din- 
ner, the mad Hatter in Alice “expects” 
treacle to be drawn out of treacle well, 
and the physicist “expects” freely fall- 
ing bodies to be uniformly accelerated 
—but they are doing different things 
and nothing but confusion can result 
from lumping their activities on the 
ground that all of them “expect” some- 
thing. If the analogy between con- 
structs and predictive hypotheses is not 
to be highly misleading, then there 
must really be a significant similarity 
between what I do when, on the basis 
of my constructs, I anticipate certain 
behaviors from another and what a 
scientist does when, on the basis of a 
hypothesis, he makes a prediction. 

The following considerations bring 
out some radical differences between 
these activities. 

(a) My anticipation of the boss’ be- 
havior tends to be a_ self-fulfilling 
prophecy. Because I construe him as 
dominating I insult him, and this is 
likely to make him do just what I 
“predicted” he would do when I con- 
strued him as dominating. But scien- 
tific predictions have no tendency to- 
wards self-fulfillment; what I do in 
predicting an eclipse in no way tends 
to make it happen. 

(b) The validation or invalidation 
of anticipations based on constructs is 
“subjectively construed.” Thus 


a person may anticipate that he will fall 
down the stairs and break his leg. If his 
prediction turns out to be true, or at least 
if it seems to him that he has fallen down 
the stairs and broken his leg, he experiences 
validation [pp. 158-159, 13; author's italics]. 


This subjective validation of anticipa- 
tions based on constructs differs 
sharply from the objective verification 
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of predictions. Facts about which 
there is intersubjective (i.¢., objective) 
agreement determine whether the 
eclipse I predicted did, or did not, 
occur, But suppose I have construed 
the boss as hostile, he asks me to work 
overtime, and I construe this as valida- 
tion. Was the request in fact hostile? 
The answer depends on the boss’ inten- 
tion—his request was not hostile un- 
less he intended (consciously or not) 
to attack me in making it—and thus 
involves not merely construing but con- 
struing how he construes. Now when 
I construed him as hostile I made a 
decision about the kind of person he is; 
I am, therefore, “set” to construe what- 
ever he does as an attempt to dominate. 
Even a conventional “How are you?” 
is something I could construe as having 
hostile intent—e.g., he is trying to show 
up my weakness by making me admit 
that I have not been feeling well. That 
such “validation” lacks objectivity is 
clear. What is not clear is how there 
could ever be something the boss does 
that could not be subjectively con- 
strued as validation. Is even a sub- 
jective invalidation of constructs pos- 
sible? 

(c) Kelly claims it is because a man 
“can usually assess the outcome of his 
predictions at a different level of con- 
struction from that at which he origi- 
nally makes them [p. 13].” What he 
means is that since “no construct ever 
stands entirely alone [p. 304],” it may 
happen that, though I originally con- 
strued the boss as dominating, I cannot 
construe what he proceeds to do as 
dominating because to do so would con- 
flict with some other constructs I use. 
In other words, such a construal would 
not fit into the “interweaving of con- 
strued likenesses and differences [p. 
304]” involved in my system of 
constructs. 

But would this invalidate my con- 
struct? Here it must be remembered 
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that a construct is a dichotomous cati 
gory involving both similarity and d 
ference, €g, fear versus dominatio 
And this construct would not be inva 
dated when what the boss proceeds” 
do cannot be subjectively construe 
an attempt to dominate. For as Kell 
himself points out, what a person 
likely to do in such a case is “to f 
verse himself along the dimensiona 
lines which have already been 
tablished [pp. 128-129, 133-134 
Though my “prediction” has been sul 
jectively invalidated, I am likely 
retain the construct on which i 
based and to use it now to con 
the boss as someone who is a 
Suppose this construal in turn is sub 
jectively invalidated by what he pro: 
ceeds to do. Will this finally fe 
me to reject the construct as i 
sistent with the facts? No again. 
in that case I can “constrict” the 
struct, render it less “permeable,” 
thus exclude the boss from its “ 
of convenience [pp. 477, 79-80, 
77, 137].” Thus it is not lo 
possible for the facts ever to 
my construct—not even subje 
Surely this is why an experi 
study of the effects of interpe 
predictions on changes in person 
structs concludes that the 


findings with respect to the effect of 
dation on construct change are inconch 
... [there was a] failure of invalidai 
show a statistically significant relai g 
ao in constructs [Poch, 1952, pp. &. 
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Since constructs cannot be fals 
(not even subjectively) by what in 
happens, they differ fundamet 
from predictive hypotheses which ca 
be. 1 

3. Another fundamental differet 
between construing and predic 
comes clear when we note that 
tile,’ “dominating,” mature, 
tle,” and other poles of personal 
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structs are not only descriptions of 
behavior, they are also evaluations. 
My construal of the request to work 
overtime as hostile is not value-neu- 
tral; to construe the boss as dominating 
is not like perceiving something as an 
apple, but like perceiving it as a 
bad apple. 

If construing did not involve evalua- 
tion it could not channelize behavior. 
For my prediction that the boss will 
try to dominate me can channelize my 
behavior towards him only insofar as 
I value or disvalue the anticipated out- 
come. If I don’t care one way or the 
other about dominating or being domi- 
nated, then my anticipation that he 
will dominate me gives me no reason 
for acting one way rather than another. 
So if my behavior is to be psycho- 
logically channelized by the way I con- 
strue, then my construals must involve 
an evaluation of what I am construing. 
But a prediction, as such, never evalu- 
ates what it predicts. 


PREDICTIONS AND HyPOTHESES VERSUS 
DECISIONS AND RULES 


I have argued that using constructs 
is not like using hypotheses to make 
predictions. What leads Kelly to 
think it is? He claims that the girl 
who “anticipates” marriage to a man 
having certain characteristics, the old 
maid who “predicts” a husband that 
never comes along, and the navigator 
who predicts his arrival at the North 
Pole are all doing the same sort of 
thing—they are using constructs to 
predict the future. According to 
Kelly, what each of them predicts is 
the “common interesect of a certain set 
of properties,” and the only difference 
between them is that in the case of the 
old maid “there are altogether too 
many dimensions involved and nobody 
ever lands on the precise point where 
all of them converge. Her long-stand- 
ing anticipation is never fulfilled [p. 


121}." This is why Kelly thinks that 
the behavior of the girl and the old 
maid is like that of the navigator—a 
matter of using “hypotheses” (con- 
structs) to anticipate the future. 

But this confuses entirely different 

i Though the navigator may 
say “I ought to (will, should) see a 
place having such characteristics,” and 
this sounds very much like the girl 
saying “I ought to (will, should) 
marry a man having such characteris- 
tics,” what they mean is different. For 
the navigator’s “ought” is predictive 
while the girl’s “ought” is prescriptive. 

Consider a typical prediction: “I 
should see an eclipse at such a time.” 
Certain facts (i.e., planetary positions 
and dimensions), together with certain 
laws (i.e, Kepler’s), are the basis for 
an inference about when the moon will 
pass into the earth’s shadow to produce 
a lunar eclipse. We are predicting the 
time of the eclipse by predicting the 
future positions of the moon and the 
earth from their present ones, and the 
“connection” between these data is ef- 
fected by laws formulating uniformities 
in planetary behavior. When we are 
not fully confident of the uniformity on 
which our prediction is based, we call 
it a hypothesis rather than a law 
and wait for the facts to confirm or 
refute it. 

In predicting, I thus describe what 
will happen by drawing an inference 
on the basis of some laws (hypotheses). 
These laws describe, e.g., planetary be- 
havior, but they do not evaluate it. 
They are not directions laid down to 
guide the planets towards the “right” 
motions; the planets did not “decide” 
to follow these laws, nor could they 
decide to disobey them. When Kep- 
ler first formulated his hypotheses and 
used them to predict planetary motions, 
this could have had no tendency to 
make happen what he predicted would 
happen. The facts, “objectively con- 
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potheses are true or false. 

Compare this with the girl who says 

“I should marry a man having such 
characteristics.” She is not predicting 
what will happen, but is expressing her 
intention to marry that kind of man. 
She has decided that this is the “right” 
sort of man for her. Her decision is 
not an inference based on laws describ- 
ing what in fact happens; it is a pre- 
scription for what she should do based 
on rules she follows in deciding what 
to do. These rules, unlike laws (hy- 
potheses), guide and direct her activity 
by evaluating certain behaviors as the 
right (appropriate) things to do. 
(e.g, “I should be a dominating per- 
son, like father, not one who lives in 
fear.”) She has decided to follow 
this rule and she could disobey it. 
In following the rule she makes things 
happen—i.e., she turns down this suitor 
and that because he does not have the 
requisite characteristics. The rule she 
follows thus tends to bring about what 
it prescribes; indeed, that is its func- 
tion. What in fact happens cannot 
show the truth or falsity of the rules 
she follows. For rules prescribing 
what should be done cannot (logically) 
be confirmed or disconfirmed by the 
facts; unlike hypotheses which describe 
what is done, they cannot be true or 
false. But it can be foolish or self- 
destructive for someone to follow cer- 
tain rules. The trouble with the old 
maid who never finds a husband is not 
that she lacks skill in using hypotheses 
to make predictions; it is that she fol- 
lows foolish rules—the conditions she 
lays down for the right man involve 
“altogether too many dimensions” and 
so she decides to turn down every 
suitor. 

Kelly’s discussion of construing thus 
fails to distinguish between the way 
rules which prescribe behavior are used 
to decide what one should make hap- 
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pen, and the way hypotheses (laws) 
which describe behavior are used to 
predict what will happen. 

All this is not to suggest that Kelly's 
approach is “unscientific,” nor to advo- 
cate that clinicians model their work on 
the theories developed by physicists. 
I will deal with metatheoretical ques- 
tions in the second part of this paper 
and will there defend the use of per- 
sonal constructs for the explanation of 
human actions. My aim here is to 
clarify the nature and function of per- 
sonal constructs. What I am arguing 
is that there are important logical dif- 
ferences between using hypotheses to 
make predictions and using rules to 
make decisions, and that construing 
resembles the latter rather than the 
former, 


RELATION OF CoNSTRUCTS 
TO BEHAVIOR 


The foregoing distinctions can now 
be used to specify the way constructs 
channelize behavior. To use con- 
structs is, first of all, to follow rules 
in classifying people. If I construe in 
terms of fear versus domination, I have 
decided that people should be classified 
as “similar” or “different” on the basis 
of whether they dominate others oF 
live in fear of others. The fear-domi- 
nation construct is not a hypothesis I 
use to predict that the people I meet 
will be either dominating or f 
types; I perceive the people I meet as 
dominating or fearful types because I 
classify them on the basis of the fear 
versus domination construct. r 

Finding that, e.g., my boss exhibits 
certain behaviors which fall under the 
domination pole of my construct, 
classify him as a person who tries to 
dominate others. Since to classify 
something as an X is to expect it to 
behave in X-ish ways, this construal of 


the boss will lead me to expect domi: 


nating behavior from him. But the 
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jective anticipations concerning the 
mre behavior of another which fol- 
low from my construal (classification) 
of him cannot, of themselves, chan- 
fielize my behavior. For they provide 
me with no basis for action unless I 
> evaluate what I anticipate. Con- 
ucts provide such evaluation be- 
, as Kelly (1955) puts it: 
TI person builds his life upon one or the 
ther of the alternatives represented in each 
bf his dichotomies. This is to say that he 
relative values upon the ends of his 
omies. Some of the values are quite 
t... others are quite stable and 
esent guiding principles. Even the stable 
are not necessarily highly intellectu- 
cd—they may appear, rather, as appetitive 
ferences [p. 65]. 


the other, the construct functions as a 
ile in terms of which I can decide 
hat I should do about someone whom 
[have construed this way. The con- 
t “governs p. 132]” my behavior 
use it is a rule I use in deciding 
hat the right thing for me to do is. 


s the one in terms of which J must 
act: Either I allow the boss to domi- 
nate me and live in fear of him, or I 

y to dominate him and make him 
tar me. If I value the domination 
bole, then the latter will be the right 
se for me to follow. On the basis 
Í such a “calculation,” I may decide 
o insult the boss (see the Choice 
prollary, Kelly, 1955, p. 64; also pp. 
87-488, 497-498, 521-526). If asked 
I insulted the boss I will bring 
my anticipations about what would 
n if I did, or did not, insult him. 
Í anticipations are my reason for 
nsulting him. What makes them a 
feason to act this way rather than 
hother is a rule I follow in deciding 


what the right thing to do is. It is 
this rule which “connects” the subjec- 
tive anticipations following the be- 
cause” in “I will insult him because 

. with my decision to insult him. 

My behavior towards the boss has 
been psychologically channelized by the 
fear versus domination construct be- 
cause my decision to construe him as 
dominating and the subjective antici- 
pations which followed from it, as well 
as my decision to insult him in order 
to dominate him instead of having to 
fear him, have been guided by the 
construct—the construct is the rule I 
followed in making these decisions. 
But the discussion has been oversim- 
plified since actual decisions are likely 
to be determined by the intersection of 
a number of construct dimensions 
rather than by a single construct. 
Since constructs are arranged in a 
system of “ordinal relationships“ 
some being like “fundamental princi- 
ples” while others are “subordinate 
[pp. 56, 61, 78]"—the system of per- 
sonal constructs is the hierarchical sys- 
tem of rules a person uses to guide 
and direct his actions. That is why 
personal constructs channelize a man’s 
actions and can be used to explain 
them. 


I now turn to metatheoretical con- 
siderations. Just how do personal con- 
structs, understood in light of the fore- 
going discussion, serve to explain and 
predict an individual's behavior? Are 
there significant logical differences be- 
tween such explanations and predic- 
tions on the one hand, and those given 
by S-R, or need or drive theorists, 
on the other? 

As a point of departure, consider 
Meehl’s (1954) “Logical Reconstruc- 
tion of Clinical Activity” (Ch. 6). 
Meehl argues that: 
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every skilled clinician must be making use 
of some laws, however vague, which may be 
of considerable generality, but which never- 
theless make it possible for him to order 
his material with respect to a given patient 
in terms of some general nomothetic basic 
psychodynamics [p. 53). 


In “forming the concept of this person” 
the clinician is “assessing his needs and 
his modes of satisfying those needs,” 
and this means that he has to 

utilize the given facts, together with crudely 
formulated laws, to invent a hypothesis con- 
cerning the state of certain in ing vari- 
ables or hypothetical constructs in his patient 
Ip. 46]. 


The law the clinician uses is nomothetic 
in its form, but “in its end terms, i.e., 
in the defining properties . . . that 
specify ‘S’, ‘R’, ‘G’, ‘D’, etc., strongly 
idiographic [p. 64; also pp. 42-44, 
60].” It is very difficult to specify 
the general defining properties of these 
classes “because in the case of human 
social behavior the defining properties 
are, in general, not physicalistic [p. 
54].” Consequently, it is only in light 
of a hypothesis about the particular 
patient that one can see that a certain 
behavior datum belongs to, e.g., a cer- 
tain S or R class, and thus can apply 
the general law to the particular case 
(pp. 52, 54-55, 82). Moreover, 
though this hypothesis about the state 
of the patient must, in conjunction with 
the law, logically entail the data, it 
cannot be logically deduced from the 
data (p. 57). The creation of such 
hypotheses about the states of variables 
in the particular patient is, according 
to Meehl, the unique contribution of 
the clinician; it is the reason why 
he cannot be replaced by a Hollerith 
machine (pp. 49-50, 65-66, 78). 

In what follows I will argue that 
when a clinician explains and predicts 
his patient’s behavior in terms of per- 
sonal constructs, what he is doing re- 
quires a “logical reconstruction” that 
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differs sharply from the one offered by 
Meehl. 


CurnicaL EXPLANATIONS 


What a clinician like Kelly does in 
“forming the concept of this person” 
cannot be understood as “assessing his 
needs and his modes of satisfying those 
needs.” For Kelly (1955) explicitly 
rejects all “push theories based on 
‘stimuli’ and pull theories based on 
‘needs’ [p. 36, his italics; also 1960, 
pp. 49-50; 1962, pp. 84-86].” Against 
such theories Kelly (1960) insists that 
the clinician must “not envision the be- 
havior of man in terms of the external 
forces bearing upon him [p. 60]”; he 
“must not construe, his client as an 
unthinking object; he must construe 
his client’s construings. This is the 
central theme of the psychology of per- 
sonal constructs [1955, p. 531].” And 
this implies that there are important 
logical differences between explana- 
tions in terms of personal constructs 
and the type of explanation recon- 
structed by Meehl. 

1. For Kelly the clinician must at- 
count “for the alternative a man selects 
in a choice situation [p. 45]” in terms 
of the way that man himself construes 
his situation. Where Meehl sees the 
clinician as hypothesizing about the 
states of “process variables” (McClel- 
land, 1955) in order to apply nomo- 
thetic laws to the behavior of the pa- 
tient, Kelly sees the clinician as eliciting 
the content of the agent’s personal con- 
structs—the rules in terms of which 
this person defined what alternatives 
he had to choose between and what the 
right thing to choose is. So while 
Meehl’s account takes the point of 
view of an external spectator of per 
ple’s behavior and emphasizes “PSY” 
chological process” (i.e., the applica- 
tion of general principles about ow 
people satisfy needs to the particular 
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case), personal construct psychology 
takes the point of view of an agent 


who makes choices and emphasizes 
“psychological content” (i.¢., what con- 
structs on the part of this person lead 
him to make such choices). 

2. On Mechl's (1954) reconstruc- 
tion, the clinician explains behavior by 
showing that it is predictable (ie,, 
entailed by laws in conjunction with 
an hypothesis about the state of the 
patient) and thus not surprising. Clin- 
ical explanation is clinical pre- or post- 
diction (p. 123). In contrast to this, 
the personal construct clinician uses 
the content of the patient’s constructs 
to show that because the patient fol- 
lowed these rules he chose this as the 
right thing to do. Instead of showing 
how the behavior to be explained can 
be subsumed under general laws, the 
personal construct clinician shows that 
if one followed these rules (i. e., used 
these constructs) then this would seem 
the right thing to do. In one case we 
understand behavior by seeing its oc- 
currence as predictable; in the other, 
by seeing its choice as intelligible. 

3. Since explanations in terms of 
personal constructs are given from the 
agent’s point of view, they must in- 
volve the sort of concepts the agent 
himself understands and uses in decid- 
ing what to do. Fear versus domina- 
tion, gentle versus aggressive, etc., are 
considerations of the kind an agent 
uses in deciding what to do. This dif- 
fers sharply from Meehl’s clinician 
whose explanatory concepts—e.g., drive 
States, needs, reaction formations, etc. 
—are not the sort an agent uses to 
explain his own behavior. Whether 
the person is capable of understanding 
Meehl's kind of explanation is irrele- 
vant to its correctness; but since per- 
sonal constructs explain from the 
agent’s point of view, the explanation 
cannot be correct if it involves con- 
cepts which the agent cannot grasp and 


thus cannot have used in deciding what 
to do, This does not mean that what- 
ever explanation the agent gives for his 
own behavior is correct, or even that 
he can verbalize the constructs (rules) 
on which his decision was based. For 
the agent may not be aware of the 
content of some of his constructs, or 
of the way he uses them, because they 
are at a “low level of cognitive aware- 
ness i. e., some of the constructs in- 
volved may be “preverbal,” or one end 
of a construct may be “submerged,” 
or a construct may have been “sus- 
pended” (Kelly, 1955, pp. 459-476; 
also Mischel, 1963). In a clinical set- 
ting the patient, typically, is unable to 
tell correctly, or at all, how he really 
construes and it is the clinician's job 
to discover the rules on which he bases 
his choices. Nevertheless, the con- 
structs used to explain behavior must 
be such that in principle the agent 
could use them to make his choices. 

Explaining behavior in terms of per- 
sonal constructs thus differs from ex- 
plaining the behavior of an “unthink- 
ing object” in terms of “pushes or 
pulls”; to give such explanations the 
clinician must try to “understand the 
client by putting himself in the client’s 
shoes and attempting to stand and walk 
like the client [1955, p. 333].“ 


JUSTIFICATION OF CLINICAL 
EXPLANATIONS 


Many psychologists see taking the 
agent's point of view as a way of dis- 
covering a hypothesis that may explain 
the patient’s behavior; but since the 
intuitive plausibility of a hypothesis is 
no guarantee of its truth, they often 
insist that the only way it can be 
justified is by showing that in con- 
junction with general (nomothetic) 
principles it entails such behavior. 
Thus Meehl (1954), although he is 
“impressed” by the “vacuous character 
[of general principles] insofar as gen- 
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erating the particular hypothesis is 
concerned [p. 50],” insists that a hy- 
pothesis about the patient can be justi- 
fied only by showing that in conjunc- 
tion with general principles it entails 
his behavior—this is the only scien- 
tific,” or even “rational,” support that 
can be given for it (pp. 57, 66, 72-73, 
81). Similarly, Falk (1956) holds 
that we can “turn to idiographic pro- 
cedures for the provision of working 
hypotheses, not for their confirmation 
Ip. 61].” 

Yet when we turn from theoretical 
discussions to clinical practice there is 
something oddly irrelevant about the 
nomothetic principles which, allegedly, 
are needed to justify clinical explana- 
tions. Consider Meehl’s (1954) ex- 
planation of a patient’s dream in which 
she feels resentful towards someone 
dressed in a riding habit. He inter- 
prets the dream as involving a sexual 
pun on the phrase “riding habit” so 
that the person in the dream is her boy 
friend with whom she does not want 
to establish the habit of having sexual 
relations (p. 71). Is it really the. 
use of such general principles as these: in 
dreams abstract notions are often repre- 
sented by concrete forms and processes and 


in a minority of cases the mode of plastic 
representation involves a pun [p. 72] 


which justify the clinician’s hypothe- 
sis? Meehl himself says that “the re- 
mainder of the dream and her asso- 
ciations to it... confirmed this 
hypothesis [p. 71]”! What actually 
justifies the clinician’s hypothesis seems 
to be the way his explanation “fits” 
(makes intelligible) other things the 
patient says and does; the general prin- 
ciples seem “vacuous” not only with 
respect to the discovery, but also with 
respect to the justification of such clini- 
cal hypotheses. (Similar considera- 
tions apply to Meehl’s other illustra- 
tions; vide pp. 48-50.) 

What leads writers like Meehl to 
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insist that general laws (“how 
vague”) must be used to justify clinical 
explanations is not the facts of clinical 
practice—the very “vague” and “vacu- 
ous” character of these laws indicates 
that they can be of no use to the prac 
ticing clinician. What really lies b 
hind this insistence is a theoretical 
point about the logic of explanation, 
The model of explanation which most 
psychologists have in mind derives 
from the physical sciences, and in those 
sciences the “ordering” of the data, 
the “connection” between the fact being 
explained (e.g., the time of an eclipse) 
and the facts offered in explanation 
(e.g. planetary positions, dimensions, 
etc.) is based on general principles. 
Since the event to be explained and 
the events offered in explanation 2 
separate and distinct, they can be con- 
nected, or ordered, only by means of 
general principles asserting a relation 
between such events. Now if we 
view a person’s acts from the stand- 
point of an external spectator, them 
they will be seen as “behavior data, 
i.e., happenings to be explained on the 
basis of the foregoing model. 
sexual pun in the dream will then be 
regarded as an “event” whose occut- 
rence can only be explained by con 
necting it with other events. Since 
this can only be done by means OF 
general principles, a psychologist wHo 
construes the situation in this way Y 
claim that it is only in terms of (noma 
thetic) laws that one can (logically) 
“order his materials with respect to & 
given patient,” or “rationally justify” 
one’s explanations. 
But the situation is radically dit 
ferent when we explain behavior from 
the agent’s point of view, i.e., in terms 
of the way he construes. For the sex- 
ual pun in the dream will then be 
seen not merely as a happening but aà 
an action—in dreaming this the patie 
is expressing her intention not to €% 
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tablish habitual sexual relations with 
her boy friend. Why she has that 
intention can be explained (i.e., con- 
nected with the rest of her behavior) 
by showing how the way she construes 
would make this the wrong thing for 
her to do. Why she expresses this 
intention in a dream, rather than 
overtly, can be explained by showing 
how the way she construes would make 
an overt expression of hostility 
wrong. Since constructs are the rules 
a person follows in deciding what to 
do, they are themselves the ordering 
principles which connect the action 
with the construal that explains it. 
Personal constructs explain behavior, 
not by showing that such behavior 
data are deducible from these con- 
structs in conjunction with general 
principles, but by showing that such 
actions are intelligible because this 
would seem the right thing to do if 
one construed this way. Since such 
explanations are based on the content 
of the rules an agent follows in making 
choices, rather than on general laws 
connecting various kinds of happen- 
ings, they stand in no need of nomo- 
thetic laws for their justification. (For 
a fuller discussion of this issue, see 
Mischel, 1963). 

But this does not mean that giving 
such explanations is an intuitive, non- 
empirical procedure. The clinician 
must discover how his client construes 
from the facts and he must justify his 


explanation in terms of the facts i. e. 


what this person says and does, or 
would say and do under certain cir- 
cumstances. Having discovered the 
content of a person’s constructs from 
what he says and does, the clinician 
explains his behavior by showing that 
the way he in fact acts in various sit- 
uations is just what would seem the 
right way to act if one followed the 
rules he follows. If the person says 
or does something that cannot be ex- 


plained this way—i.c., the constructs 
he is taken to be using do not lead to 
a calculation that would make this 
seem right under such circumstances 
—the explanation has been falsified by 
the facts. 

Since people in their words and 

deeds not only deceive others, but 
often deceive themselves about the rea- 
sons for their choices, it may be very 
difficult to discover just how a person 
really construes. To discover this is 
the empirical task of the clinician and 
the Psychology of Personal Constructs 
is addressed to that task. Kelly (1955) 
tells the clinician : 
If you do not know what is wrong with a 
person, ask him; he may tell you. The 
clinician who asks such a question will 
have to be prepared to do some careful list- 
ening [pp. 322-323]. 

Kelly’s whole theory, the various 
“corollaries” as well as related tech- 
niques like the REP test and the 
analyses of self-characterizations, may 
be regarded as a guide to such “care- 
ful listening”; it shows the clinician 
what to look for in order to discover 
“the full import of his client’s thinking 
[pp. 71-72]”—i.e., why he construes 
what he does as the right thing to do. 
The clinician who discovers this can 
help his client because “from the stand- 
point of the psychology of personal 
constructs, psychological disorders can 
be traced to characteristics of a per- 
son’s construction system [p. 832].” 
Thus for Kelly a psychological disorder 
“Gs not explained simply as the inevita- 
ble outcome of any past which happens 
to be unfortunate. It relates to one’s 
present stance as he faces the future 
[p. 835].” Anxiety, guilt, hostility, 
aggressiveness, and various personality 
disorders are interpreted by Kelly as 
a function of the way a person con- 
strues, the way he uses self-defeating 
and deceiving rules in trying to cope 
with life (Ch. 16 & 17). Psychological 
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illness thus becomes something that 
differs in kind from physical illness— 
not so much something that “happens” 
to a person as a causal consequence of 
“external forces bearing upon him,” as 
something he himself does by the way 
he construes. And psychotherapy be- 
comes helping a person to “reconstrue.” 


EXPLANATION VERSUS PREDICTION 


But what about predictions? Kelly 
(1955) rightly points out that when 
the clinician understands the way peo- 
ple construe he has “grounds upon 
which [he] can make predictions re- 
garding how people will act [p. 67].” 
If we understand the way someone 
construes then we can predict, e.g., 
that he is likely to classify another per- 
son as being of Type X and so to 
anticipate X-ish behavior from him; 
and that, since he construes Y as the 
right thing to do about X-ish be- 
havior, he is likely to relate himself 
to this person by doing Y. So if 
someone follows certain rules then we 
can expect certain regularities in his 
behavior and thus make predictions. 
Yet the predictions which we base on 
the rules a person is taken to follow 
in deciding what to do differ from those 
based on nomothetic laws. 

To see the difference, note that when 
laws are used to explain physical phe- 
nomena the criteria for deciding 
whether or not various phenomena are 
“similar,” and thus capable of sub- 
sumption under the same nomothetic 
law, are those of the observer—what 
makes falling apples and the motions 
of the planets similar phenomena when 
we are doing dynamics is Newton’s 
theory of motion. Since the observer’s 
theory specifies the phenomena to 
which it is applicable, predictive failure 
shows that the theory on which the 
prediction was based must be faulty. 

This is also true when we are dealing 
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with animal behavior because we set 
up the defining conditions for, e.g., a 
certain S or R class (Meehl, 1954, p. 
42). But the situation changes when 
we explain human behavior in terms 
of the rules a person follows in decid- 
ing what to do. For then the criteria 
of similarity are not those of the ob- 
server but those of the agent; it is not 
what we regard as, e.g., “dominating” 
behavior, but what he regards as such 
behavior that determines how he will 
act. Since understanding how he con- 


strues is understanding the rules he 


follows in classifying people and decid- 
ing what to do about them, predictive 
failure will, of course, make us wonder 
whether he really construes the way 
we thought. But it will not compel us 
to withdraw the claim that this is how 
he construes. For it may be that, 
though he usually construes the way 
we think (i.e., classifies people on the 
basis of these rules, etc.), he had some 
reason—to be explicated in terms of 
the rest of his personal constructs— 
for construing differently in the situa- 
tion where our prediction failed. Be- 
cause he is the one that must apply the 
rule that determines “similarity,” it is 
always possible that he had some rea- 
son for not seeing the situation as simi- 
lar to others in the way we predicted 
he would. Since the prediction was 
not based on a nomothetic law whose 
“end terms” can be defined by the ob- 
server’s theory without reference to the 
way that person himself construes, pre- 
dictive failure will not show that there 
must be something wrong with the 
“theory” on which the prediction was 
based. 

Consider an example: Meehl (1954) 
argues that the only way we can 
“prove” that a certain “way of talking 
on the part of a patient is associated 
with hostility towards his sister is by 
carrying out a “statistical analysis on 
the verbal protocols of the individual 


case [p. 81].” But what constitutes a 
similar way of talking when exactly 
the same words said in a different con- 
text, or tone of voice, may be a differ- 
ent way of talking? The clinician will 
have to specify criteria for identifying 
similar ways of talking before he can 
carry out his statistical analysis. Fail- 
ure to find a significant correlation 
between what he has specified as simi- 
lar protocols and hostility may mean 
either that these protocols are not as- 
sociated with hostility, or that from the 
patient’s point of view they are not all 
similar. For it is not the clinician's 
criterion but the patient’s own con- 
strual (i.e, his use of rules) which 
determines whether or not what he 
Says on one occasion is really similar 
to what he says on others. Since sta- 
tistics cannot settle this issue they can- 
not be the final court of appeal that 
settles claims about how a person really 
construes. 

This leads to my final point: The 
Statistical (predictive) approach and 
the personal construct approach have 
such different aims that they cannot be 
Significantly compared. If we are in- 
terested in predicting behavior, then 
Statistics may be a more reliable guide 
than personal constructs; but if we are 
interested in explaining human actions 
then personal constructs cannot be re- 
placed by statistics. For predicting 
and explaining are different, and suc- 
cess in the former is not a criterion of 
Success in the latter. Careful analysis 
of the way the tides behaved in the 
past enabled the ancient Babylonians 
to predict their future occurrences with 
accuracy; but they could not explain 
the occurrence of the tides. Newton’s 
theory of gravitation provided an ex- 
Planation, but it did not lead to better 
Predictions. In fact, most tidal pre- 
dictions are still based on statistical 
analyses of their past behavior rather 

on Newton’s theory. Though 
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“positivistic” thinking about science 
was dominated by the idea that there 
is no essential difference between ex- 
planation and prediction, more recent 
studies of the logic and history of sci- 
ence have brought this idea under 
sharp attack even with respect to the 
physical sciences (Hanson, 1958, 1959; 
Toulmin, 1961). Surely this may en- 
courage the clinician to point out that, 
since his primary aim is to understand 
his patient with a view to helping him, 
he is doing something different from 
the actuarian, something that cannot be 
relevantly evaluated in the same terms. 

For suppose a clever statistician, 
who knows nothing of chess, were to 
observe the behavior of chess players. 
He would soon notice certain regulari- 
ties in the way pawns, etc., move; he 
would also notice regularities in the 
way a particular player moves one 
figure rather than another (e.g., when 
besides the king he has only a rook 
and a bishop, under what conditions 
does he move the former rather than 
the latter?). Given time and a good 
calculating machine, he could learn to 
predict from the configuration of pieces 
on the board what “move” this player 
is likely to make. But predictive suc- 
cess would not show that he under- 
stands the move—indeed, he does not 
even understand that it is a move in 
the game of chess! It is simply irrele- 
vant to compare what this statistician 
is doing with what is done by someone 
who wants to make sense of the chess 
player’s behavior by showing, in terms 
of the rules of chess and the strategy 
followed by this player (e.g., the kind 
of end-game he was trying), why he 
construed this as the right move to 
make. If we can explain the moves 
made by a chess player in terms of the 
rules and strategic principles he fol- 
lows, we will also be able to predict 
what move he is likely to make in 
certain circumstances. Still, what we 
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are doing differs from what is done by 
the statistician interested in predicting 
behavior. 

Philosophers influenced by Wittgen- 
stein’s (1953) discussion of rules have 
recently argued that since the social 
and historical sciences deal with rule- 
following human behavior they require 
a model of explanation which differs 
from that of the physical sciences 
(Dray, 1957; Winch, 1958). I have 
tried to develop a similar argument for 
clinical explanations based on personal 
constructs. Nothing I have said is 
intended to detract from the impor- 
tance of nomothetic approaches to the 
study of behavior. But my argument 
is a defense of an idiographic approach 
to clinical activity in that it has tried 
to show that the use of personal con- 
structs to explain individual behavior 
is different from, and independent of, 
the use of general laws. 
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THE ORIGIN OF SELF-CRITICISM ' 
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The learning of self-criticism is represented as the prototype of a form of 
internalization in which the child replicates certain components of the punish- 
ments to which it has been previously exposed. The stimulus properties of 
these components, first presented in the responses of a model, attain rein- 
forcement value when they come to operate as signals for attenuation of the 
child’s anticipatory anxiety. The effects of a variety of experimental sociali- 
zation paradigms are combined with an analysis of naturalistic socialization 
to illustrate the mechanisms in the acquisition of self-criticism. Some ex- 
tinction paradigms are also used to examine the sources of its persistence. 
Both the motivating and the signalling functions of social punishment are 
described as being essential to its reproduction, while the rewarding charac- 
teristics of the model are seen to bear only a very tangential relationship to 
the phenomenon. The motivational antecedents and consequences of self- 
criticism are considered to be independent of the cognitive substance that 


defines its evaluative character. 


The reproduction of the punitive re- 
actions of socializing agents is a readily 
observable phenomenon in the behavior 
of young children. Children often 
show punitive responses to their own 
actions, and they sometimes recreate 
their parents’ disciplinary actions in 
exact motor detail. As language and 
cognition increasingly take the place of 
action, social punishment becomes most 
extensively and persistently reproduced 
in the form of verbal and evaluative 
responses. It is very common, for ex- 
ample, to see a young child react to 
its misdemeanors by saying aloud to 
itself: “Youre a bad boy!” or “That 
was bad!” Such responses become 
more covert as the child’s symbolic 
equipment expands. But people do 
continue beyond childhood to frequently 
direct toward themselves (and toward 
others) various components of the so- 
cial punishment to which they have 
been previously exposed. These self- 


1The author is indebted to two of his 
colleagues, F. W. Irwin and R. L. Solomon, 
for their kind readings of an earlier version 
of this paper. He also wishes to thank 
Selma Roseman for her capable research 
assistance in collecting the experimental data. 


punitive reactions belong to a class of 
behavior whose social learning repre- 
sents one important type of imitation 
or identification. They are also of con- 
siderable interest because they seem 
rather paradoxical when approached in 
terms of our conventional concepts of 
motivation and reinforcement. 

My purpose in this paper is to ex- 
amine the origin of self-criticism as an 
illustrative case of the learning and 
motivational processes which more 
broadly underlie the adoption of a 
model’s punitive responses. The con- 
ditions of aversive learning under 
which children acquire the use of the 
label “blue” to refer to their own ac- 
tions, when the label has been repeat- 
edly used as a component of punish- 
ment by a socializing agent, will be 
demonstrated with a series of experi- 
mental socialization paradigms. Evi- 
dence bearing upon the resistance to 
extinction of the self-critical response 
will also be presented. The results of 
the paradigms will be used, together 
with an analysis of naturalistic sociali- 
zation, to advance the view that self- 
criticism is acquired because it actually 
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reduces anxiety, and that it does so 
precisely because evaluative labels were 
originally subsumed under or contigu- 
ous with external punishment. The 
acquisition of self-punitive responses 
in general will be described as having 
only a very limited and indirect rela- 
tionship to the rewarding characteris- 
tics of a model. Finally, it will be 
argued that self-criticism reveals the 
essential motivational antecedents and 
consequences of any internalized evalu- 
ative response to a socially defined 
transgression, regardless of the nature 
or extent of the cognitive substance of 
the response. 

The evaluative judgment of one’s 
own actions has been traditionally re- 
garded as the foundation of internalized 
conduct (Freud, 1936, Ch. 8; Hart- 
shorne & May, 1928; Piaget, 1948; 
Sears, Maccoby, & Levin, 1957, Ch. 
10). Many self-evaluative responses 
must be derived, of course, from social 
learning based on reward. But the 
tendency of children to independently 
replicate the punitive and prohibitive 
functions of socialization makes a dis- 
tinct contribution, and one that is at 
least comparable with the effects of 
social reward, to the distant relation- 
ship that human behavior often bears 
to what would seem like more inher- 
ently preferred responses. This repli- 
cative effect of social punishment is 
clearly displayed in the internalized 
dispositions which lead people to im- 
pose upon themselves both restraint and 
unpleasant consequences of their own 
actions. In many contemporary psy- 
chological conceptions of internaliza- 
tion (Miller et al., 1960, Ch. 5; Sears 
et al., 1957, Ch. 10; Whiting & Child, 
1953, Ch. 11), self-criticism is viewed 
as the mediator of other reactions to 
transgression, and also of control over 
the occurrence of transgression, when 
such responses can be elicited in the 
absence of direct external surveillance. 
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It is, in fact, commonly assumed that 
the internalized consequences of social 
punishment require the operation of 
explicit standards or values which have 
been adopted from those of a model. 
There is actually considerable evi- 
dence to question the assumption that 
the adoption of a model’s values or 
standards is requisite to internaliza- 
tion. The casual observation of the 
first appearance of internalization in 
preverbal children makes it particu- 
larly obvious that internalized reactions 
to socially punished acts do not neces- 
sarily imply the presence of self-critical 
cognition. Even when they do not yet 
have the equipment for social evalua- 
tion, young children frequently show 
signs of anxiety and anxiety-reducing 
maneuvers following a transgression 
that has not been apparently open to 
external observation. Some recent 
work with dogs (reported by R. L. 
Solomon in Mowrer, 1960b, pp. 399- 
404) also suggests that internalization 
does not require a highly developed 
cognitive base. Moreover, numerous 
studies have indicated that internalized 
responses to transgression show great 
variability, among both children and 
adults, in the nature and extent of the 
cognitive mediating processes on which 
they rest (Allinsmith, 1960; Aron- 
freed, 1960; Kohlberg, 1963; Whiting, 
1959). Even self-corrective responses, 
such as confession or reparation, have 
been found to occur frequently without 
evaluative standards being applied to 
a transgression (Aronfreed, 1961). 
Nevertheless, self-criticism surely de 
serves special attention among the in- 
ternalized consequences of social pun- 
ishment. In addition to its significance 
for the phenomenon of identification, it 
also provides verbal and cognitive Sig- 
nals through which other social re- 
sponses can be mediated with a per- 
sistence and generalization that wou 
be impossible if they were force to 
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depend only on the immediate stimuli 
of the transgression itself and of its 
external context. 


ACQUISITION 


Actions (and, sometimes, intentions 
or other intrinsic precursors of action) 
may be defined as transgressions to the 
extent that they have been followed, 
in the child’s experience, by any form 
of social punishment. Social punish- 
ment includes, of course, not only the 
application of concrete noxious stimuli, 
as in physical punishment, but also 
withdrawal of affection and a wide 
variety of other kinds of aversive 
events. When an action is defined as 
a transgression, through its aversive 
social consequences for the child, its 
response-produced cues become signals 
for a conditioned drive state that can 
be given the general designation of 
anxiety. The anxiety may have, how- 
ever, qualitative variations which re- 
flect its cognitive context and possibly 
the nature of the original punishment. 
It is the invariant motivational source 
of the child’s suppression of transgres- 
sion, and also of a number of different 
responses to the occurrence of trans- 
gression, all of which share in common 
their instrumental value for reducing 
the anxiety. The instrumental status 
of these responses is derived from the 
attachment of anxiety attenuation to 
their discriminant stimulus properties, 
as a result of certain significant rein- 
forcement contingencies which are 
present in the socialization process. 
The responses may be said to be in- 
ternalized to the extent that the anxiety 
which motivates them has come to be 
directly elicited by transgression and 
independent of either external punish- 
ment or social cues which portend 
punishment.2 Such instrumental re- 


2In defining internalization, one might 
make a more subtle distinction between 
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sponses are quite variable with respect 
to the precise patterns of learning 
through which they become self-rein- 
forcing for the child. Recent studies 
in the experimental induction of self- 
criticism and reparation suggest, for 
example, that these two reactions to 
transgression are different in the spe- 
cific contingencies required for their 
acquisition and in their dependence on 
evaluative cognition (Aronfreed, 1963; 
Aronfreed, Cutick, & Fagen, 1963). 
They cannot be regarded as alternative 
expressions of a unitary evaluative and 
punitive agency such as “conscience” 
or “superego.” 

The anxiety-reducing value of some 
internalized responses to transgression, 
such as confession or reparation, may 
be acquired because the responses 
themselves originally received direct 
external reinforcement through the pre- 
vention or arrest of punishment (or 
through the removal of social cues 
which portended punishment). But 
other responses can produce cues to 
which anxiety reduction has become 
attached even when the responses have 
not occurred during the initial sociali- 
zation process. The most prevalent 
instances of such responses are those 
in which the child reproduces compo- 
nents of its previous punishments. 
Under the ordinary circumstances of 
child rearing, many components of rela- 
tively brief social punishments are suf- 


whether the anxiety occurs merely in the 
absence of punishment or whether it occurs 
in the absence of other external aspects of 
the original punishment situation (such as 
perceived observation by others). While the 
latter condition is a more stringent criterion 
of internalization, there are significant ad- 
vantages in providing a broader base of 
definition. It is very difficult to know 
whether a person’s perceived environment 
includes the possibility of external observa- 
tion. And once reactions to transgression 
become independent of punishment, they may 
take much the same form regardless of the 
actual presence or absence of observers. 
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ficiently close to the anxiety termina- 
tion that follows them to take on sub- 
stantial power as cues for the reduction 
of the anticipatory anxiety that has 
already become directly associated with 
transgression. The anxiety-reducing 
function of these components of punish- 
ment is central to an understanding of 
self-criticism. Quite aside from its 
cognitive requirements, self-criticism is 
a replication of the verbal medium of 
punishment. When a child’s punish- 
ment incorporates verbal criticism— 
the application of verbal labels or 
standards to its behavior—the criti- 
cism, like any component of punish- 
ment, may attain the significance of a 
cue signaling the attenuation of the 
antecedent anxiety that occurs in the 
interval between transgression and 
punishment. Following subsequent 
transgressions, the child itself can make 
the critical response and thus provide 
itself with anxiety-reducing cues. The 
cue aspects of the self-critical response, 
initially presented in the punitive be- 
havior of a model, thus acquire rein- 
forcement value through their conjunc- 
tion with anxiety reduction, without 
the response having to be first overtly 
emitted by the child and then exter- 
nally reinforced. A component of 
punishment becomes preferred to the 
experience of anticipatory anxiety. 
This description of the learning of 
self-criticism specifies that it is estab- 
lished as the consequence rather than 
the source of the negative affect that 


8 The anxiety-reducing cue value of pun- 
ishment may also be useful in understanding 
why some individuals rely primarily on puni- 
tive external events to define the conse- 
quences of transgression (Aronfreed, 1961; 
Kohlberg, 1963). Such individuals may be 
those who have been given sparse verbal and 
cognitive equipment or who have not found 
it reinforcing to use their own behavioral re- 
sources in response to transgression (for 
example, young children or members of 
lower status groups). 
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we experience following a transgres- 
sion. The anxiety-reducing function 
of reproduced punishment is sometimes 
compellingly visible in young children 
who, even though they are unaware of 
being observed, may show evidence of 
acute discomfort when they misbehave 
and then apparent relief after they have 
criticized or punished themselves. The 
introspective perception that the self- 
critical response sometimes precedes — 
unpleasant feelings may be phenomeno- 
logically misleading if it is used to 
judge the validity of the mechanism of 
acquisition set forth here. The re- 
sponse, once firmly acquired on an in- 
strumental basis, may begin to occur 
directly to the cues of transgression 
with a latency shorter than that with 
which anxiety reaches substantial in- 
tensity. Its maintenance would con- 
tinue to be dependent on anxiety re- 
duction, however, since the response 
would tend to be repeated once the 
anxiety had attained sufficient strength. 
Recent studies (Brady, Schreiner, 
Geller, & Kling, 1954; Brush, Brush, 
& Solomon, 1955) emphasize the im- 
portance of distinguishing the position 
of anxiety in the acquisition of an aver- 
sive response from its position in the 
maintenance of the response, and Solo- 
mon and Wynne (1954) have analyzed 
this problem specifically in terms of 
latencies.* 


The child’s repeated or sustained emis- 
sion of self-criticism following a transgres- 
sion may be attributable to the persistence off 
the anxiety response. Its extension in time 
is very probably a function of the mark 
variability in the duration of punishm' 
used by socializing agents and in the tem= 
poral position of their critical labels wi 
respect to onset and termination of 
transgression and punishment. It should 
also be noted that verbal criticism is often 
used as a warning signal that punishment 
will follow if an incipient transgression. 18 
carried out or if an ongoing transgression 
is not corrected. Consequently, the sel: 8 
critical response, though motivated by ante- 
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The learning of self-criticism em- 
braces, then, two processes: (a) the 
attachment of anxiety to the response- 
produced cues of transgression through 
their repeated association with punish- 
ment; (b) an accrual of the intrinsic 
reinforcement value of the self-critical 
response through the inclusion of its 
cue properties within external punish- 
ment in such a way as to make them 
contiguous with the attenuation of anx- 
iety. The use of two distinct proc- 
esses to account for the establishment 
of aversive responses is not uncommon 
(Mowrer, 1960a, Ch. 3-7; Solomon 
& Brush, 1956). But the second proc- 
ess described here has certain features 
which are not ordinarily found in the 
experimental literature on aversive 
learning. The learning of an aversive 
response is typically effected by having 
the response first occur and then re- 
ceive direct external reinforcement 
through avoidance of, or escape from, 
punishment. In contrast, self-criticism 
is representative of a class of responses 
whose acquisition of functional value 
does not require either that they occur 
overtly or that they have discriminable 
effects upon external reinforcing events. 
Instead, their value is accumulated as 
their cue properties are repeatedly pre- 
sented independently, through the ex- 
ternal social behavior of a model, in a 
temporal position that permits them 
to serve as signals for the reduction of 
an anticipatory state of anxiety. The 
child’s subsequent emission of these 
responses therefore becomes immedi- 
ately self-reinforcing upon their first 
occurrence. Support for such a con- 
ception is found in demonstrations that 
neutral stimuli may acquire either mo- 
tivating or reinforcing properties, ac- 
cording to their temporal association 


cedent anxiety, may not be sufficiently anx- 
iety-reducing unless it is accompanied by 
inhibition or corrections of deviance. 
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with aversive states, while relevant in- 
strumental responses are not being 
made by the organism (Gleitman, 1955; 
Montgomery & Galton, 1955; Solomon 
& Turner, 1962). 

There are some convincing indica- 
tions that direct instrumental learning 
can bring an organism to make re- 
sponses which correspond to those of 
a model (Church, 1957; Darby & Rio- 
pelle, 1959; Miller & Dollard, 1941). 
And Hill (1960) has recently spe- 
cifically attempted to describe the learn- 
ing of self-criticism by using punish- 
ment avoidance or escape as the 
external reinforcement of an initially 
overt response. But one would expect 
behavior that is established through 
direct social reinforcement, even when 
it happens to be similar to the behavior 
of a model, to have the kind of latitude 
usually permitted to responses whose 
instrumental consequences are origi- 
nally externally defined. Social rein- 
forcement is not ordinarily so selective 
as to be contingent on an extremely 
restricted form of response. Self- 
criticism, however, has the earmarks 
of the truly imitative response. It re- 
produces fairly precisely the stimulus 
properties of a model, and thus allows 
a strong presumption that its reinforce- 
ment is rather narrowly bound to those 
properties. 

It also seems implausible for other 
reasons that direct external reinforce- 
ment makes any substantial contribu- 
tion to the initial emergence of self- 
criticism. Of course, the overt ver- 
balization of the response by young 
children, once it has been established, 
might result in the withholding of pun- 
ishment or even in reward by their 
parents. But socializing agents can 
hardly consistently reinforce a response 
that is generally not apparent in be- 
havior (indeed, they must often be 
administering punishment following 
self-criticism, whether it be implicit or 
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overt). In view of the very limited 
opportunity for such a contingency that 
one sees in naturalistic child rearing, it 
would certainly seem a cumbersome 
and lengthy process to externally rein- 
force a tendency in the child to recon- 
struct a component of its previous pun- 
ishment in order to avoid the delivery 
of punishment that is externally immi- 
nent. 

Anyone who has observed the vigor 
and suddenness with which self-criti- 
cism characteristically becomes observ- 
able in young children cannot escape 
the inference that the response already 
has considerable strength at its first 
verbalization. This prepotency makes 
it necessary to go beyond the assertion 
that the stimulus properties of the re- 
sponse attain anxiety-reducing value 
without its overt occurrence. We must 
also assume that the critical label is 
implicitly emitted by the child during 
the original punishment learning. The 
response would then, before it became 
overt, have already been strengthened 
through the reinforcement value grad- 
ually attached to its intrinsic cue as- 
pects. Given the cognitive capacity of 
children, the presence of the label in 
their response repertoire, and the possi- 
bly close connection between the recep- 
tive and productive functions of lan- 
gauge, we might well expect their 
covert repetition of any label used by 
a socializing agent. 

Finally, it should be noted that this 
conception of the learning of self-criti- 
cism touches upon an issue of more 
general significance. There are cur- 
rently a number of inconsistent experi- 
mental findings on the question of 
whether stimuli can become positive 
reinforcers through their association 
with the termination of aversive states. 
The phenomenon has apparently been 
observed in some animal studies, under 
the appropriate conditions (Buchanan, 
1958; Goodson & Brownstein, 1955; 
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Montgomery & Galton, 1955), but 
other studies indicate primarily the dif- 
ficulty of its demonstration in the lab- 
oratory (Littman & Wade, 1955; 
Mowrer & Aiken, 1954; Nefzger, 
1957). Some of these experiments, 
however, do not really duplicate the 
conditions of social learning under 
which we could expect to observe such 
an effect. As Beck (1961) has re- 
cently pointed out, attempts to test the 
reinforcing properties of stimuli which 
have been contiguous with termination 
of anxiety or punishment may not be 
successful if they are carried out in 
situations in which neither anxiety nor 
pain is present. The acquisition of 
self-punitive responses does depend, of 
course, on the presence of anxiety dur- 
ing both socialization and subsequent 
occasions when the child itself can re- 
produce the components o. punishment. 


Naturalistic and Experimental Sociali- 
zation 


The view that self-criticism is ac- 
quired because it attenuates an aversive 
state does not require that the external 
criticism of socializing agents occur at 
a particular point in the course of pun- 
ishment. What is crucial to the child’s 
reproduction of punishment is the tem- 
poral position of its components wit 
respect to onset and termination © 
anxiety, and not their position wit! 
respect to onset and termination of the 
punishment itself. The usual duration 
and intensity of social punishment 15 
such that even a component at its onset 
might acquire some anxiety-reducing 
significance, provided only that there 
is sufficient time after the transgression 
for the arousal of anxiety. When chil- 
dren are quite young and under close 
parental control, punishment often fol- 
lows fairly quickly upon transgression 
But it does not follow so quickly as 9 
prevent an interval of anticipatory 
anxiety, particularly since the anxie 
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comes to be elicited by the incipient 
precursors of transgression and not 
merely by its commitment. And, as 
children become older and more mo- 
bile, the interval between transgression 
and punishment tends to lengthen. 
Consequently, despite the great varia- 
bility of parents in their timing and 
ordering of components of punishment, 
any component could assume some 
anxiety-reducing value. 

To the extent, then, that parental 
punishment induces internalized anx- 
iety and is administered in a verbal 
and cognitive medium, some acquisition 
of self-criticism by the child would 
seem inevitable. A close examination 
of the learning of self-criticism there- 
fore requires artificial conditions under 
which the verbal components of social 
punishment are varied in their anxiety- 
reducing significance. The critical la- 
bels would presumably not become 
anxiety-reducing, for example, if they 
were to follow immediately upon the 
intrinsic cues of transgression (so that 
anticipatory anxiety would be negligi- 
ble) and were consistently the first 
element in a punitive reaction of some 
duration. Conversely, one would ex- 
pect the labels to have maximal anx- 
iety-reducing value if they were al- 
ways to occur at the termination of 
punishment. The acquisition of self- 
criticism under experimental conditions 
might also be facilitated if the onset of 
punishment (but not the critical labels) 
were close to the completion of trans- 
gression. Such a temporal contingency 
probably occurs quite often in the early 
phases of socialization, when the young 
child’s behavioral limitations make it 
subject to fairly immediate punishment 
following forbidden behavior, and it 
may make a potent contribution to the 
anxiety that motivates self-punitive re- 
sponses. Animal studies (Davitz, Ma- 
son, Mowrer, & Viek, 1957; Kamin, 
1959) have indicated that short inter- 
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vals between response-produced cues 
and the occurrence of punishment fa- 
cilitate the acquisition of anxiety - redue- 
ing responses. These temporal varia- 
tions do not occur with any consistency, 
of course, in the behavior of parents. 
But they can be brought under control 
in experimental situations. 

An experimental induction of self- 
criticism ought further to be enhanced 
if the model's critical labels coincide 
with the termination of the response- 
produced cues associated with the 
transgression. In a naturalistic set- 
ting, parental punishment and criticism 
often interfere with an ongoing trans- 
gression, and thus eliminate its intrinsic 
cues and the anticipatory anxiety al- 
ready attached to them. The child’s 
reproduction of cue components of 
punishment might well be supported by 
this effect, since responses have been 
shown to have an advantage in learn- 
ing when their stimulus properties co- 
incide with the termination of a condi- 
tioned aversive stimulus (Brush et al., 
1955; Church, Brush, & Solomon, 
1956; Mowrer & Lamoreaux, 1942). 
A rapid acquisition of value by the 
critical labels of a socializing agent, 
when they coincide with termination of 
transgression, might also reflect the ab- 
sence of any delay of the reinforcement 
represented in anxiety reduction. When 
the anxiety-reducing significance of a 
component of punishment becomes dis- 
criminable to a child, there is no delay 
between the parent’s use of that com- 
ponent and the occurrence of anxiety 
reduction. Neutral stimuli have been 
shown to attain more powerful sec- 
ondary reinforcing properties when 
there is a short interval between their 
presentation and the original reinforce- 
ment (Jenkins, 1950; Schoenfeld, An- 
tonitis, & Bersh, 1950). 

There is one other feature of nat- 
uralistic socialization whose effect upon 
the internalized consequences of pun- 
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ishment is difficult to assess, because 
it is relatively restricted in variation 
and inseparable from the total parent- 
child relationship. It is the parental 
nurturance and affection that serves as 
the background for punishment learn- 
ing. Many psychologists have felt that 
the gratifications and rewards provided 
by parents are an essential ingredient 
of the child's adoption of their behavior 
and values in any form. Freud (1936, 
Ch. 8), for example, regarded the mo- 
tivation to reproduce a loving and re- 
warding parental model as one of the 
major sources of identification. And 
other theorists (Bronfenbrenner, 1960; 
Miller et al., 1960, Ch. 5; Sears et al., 
1957, Ch. 10; Whiting & Child, 1953, 
Ch. 11) have attempted to extend this 
motivation to the internalization of so- 
cial punishment. 

The most detailed rationale for the 
generalized effects of nurturance upon 
internalization is found in the work of 
Sears et al. (1957, Ch. 10) and of 
Whiting and Child (1953, Ch. 11). 
The child is presumed to be broadly 
motivated to identify with its parents 
in order to reproduce the pleasurable 
experience associated with the stimulus 
aspects of their behavior. In making 
responses like those of the parents, 
particularly when they are absent or 
have withdrawn affection, the child 
provides itself with stimuli which have 
become powerful secondary reinforcers 
through their conjunction with parental 
rewards and gratifications. Thus, the 
child’s application of the parents’ criti- 
cal evaluative responses to its own ac- 
tions represents, according to this view, 
an instance of a more general tendency 
to reproduce many kinds of parental 
behavior. And when the child criti- 
cally evaluates its own behavior, it ex- 
periences unpleasant feelings, just as 
when it was previously exposed to 
external criticism ; that is, self-criticism 
is regarded as the original internalized 
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source, rather than the consequence 
the anxiety that follows a transgre 

The use of experimental paradi 
affords an opportunity to compare 
contribution of a socializing ag 
nurturance to the induction of § 
criticism with that of the anxiety 
ducing value of cue compone 
punishment. If the internalization 
social punishment were a function 
the model’s nurturance, or of the 
drawal of affection conveyed it 
model’s punishment, one would ex 
that the child would be likely to 
the criticism of an otherwise n 
model in a punishment learning 
tion. Its adoption of the model’s 
cism would presumably reflect a me 
general motivation to reproduce 
stimulus aspects of the model, 
should accordingly not be conting 
on the timing of the criticism ¥ 
respect to onset and termination: 


7 


anxiety. 5 
Four Socialization Paradig 


In order to examine the ori 
self-criticism at close range and un 
controlled conditions, four experime 
tal paradigms were devised, in 
which the experimenter repeatedly 
the label “blue” to refer to a SP 
response whenever it was made 
child in the course of 10 training t 
The label “red” was used to reten 
alternative responses. A contri 
made it possible to control the i 
quency of the blue response am 
an artificial external signal to indi 
its occurrence. In three of the 
digms, the blue response was de 
as a transgression by punishment t 
consisted of the experimenter’ 
disapproval and removal of a 
that had been given to the child. 
punishment and the external 
were given a common onset an 
mination. In the fourth para 
blue response was not associa 


Tue Orictn oF 


punishment. None of the training par- 
adigms offered the child an opportunity 
to overtly verbalize the evaluative labels 
used by the experimenter, However, 
all of the paradigms were uniformly 
followed by three test trials designed 
to observe whether the child would use 
the labels to refer to its own actions 
when the socializing agent's role was 
temporarily eliminated. 

The external signal and the punitive 
reaction of the experimenter were in- 
itiated, during the three punishment 
paradigms, immediately upon occur- 
rence of the blue response. Since the 
punishment had a number of compo- 
nents and some extension in time, the 
label “blue” could be placed in different 
positions so as to produce extreme 
variations in its timing with respect to 
onset and termination of both the pun- 
ishment and the external signal of the 
forbidden response. In the first pun- 
ishment paradigm, the experimenter 
placed the label at termination of sig- 
nal and punishment, the position at 
which it was expected to assume maxi- 
mal anxiety-reducing value. In a sec- 
ond paradigm, the label was used at 
the very onset of signal and punish- 
ment, where there was virtually no 
time for anticipatory anxiety to de- 
velop. The experimenter’s general 
(nonpunitive) demeanor, in both of 
these first two paradigms, was neutral 
and somewhat distant. 

In the third punishment paradigm, 
the experimenter displayed consider- 
able warmth and approval toward the 
child, except when administering pun- 
ishment, and expressed both physical 
and verbal affection. This procedure 
was designed to create a nurturant 
context for socialization and an ex- 
perience of nurturance withdrawal 
when the child was punished. The 
label “blue” was placed at the onset 
of punishment and signal, so that the 
effects of the nurturance itself could 
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be compared with the effects, in the 
first paradigm, of maximizing the 
anxicty-reducing value of the label. 
The fourth paradigm was devised as 
a control to support the interpretation 
of the role of punishment in establish- 
ing the anxiety-reducing function of 
the label. No punishment was used, 
but the label occurred at the same 
point, the termination of the external 
response-signal, as it did in the first 
paradigm. 

The basic design of these paradigms, 
and some of their other operational 
features, were arrived at on the basis 
of three central tactical considerations 
which emerged from preliminary 
studies: 

1. Parents ordinarily get a number 
of “trials” in which to induce an in- 
ternalized reaction to transgression, be- 
cause young children commit the 
transgression on many occasions before 
internal controls are effective. But if 
the already socialized children who 
must be used in an experiment had 
their own intrinsic response cues to 
identify a punished act, they might in- 
hibit the act well before a new self- 
critical response had been learned. 
Furthermore, their incipient response 
cues would make it impossible to have 
an experimental condition in which 
anticipatory anxiety was negligible. 
Accordingly, the blue response was 
designed to be intrinsically indistin- 
guishable from other available re- 
sponses. Its occurrence was signaled 
by a fairly loud buzzer, whose onset 
was immediately coincident with the 
response. This signal was, then, an 
externalized response-produced cue. 

2. The punishment paradigms would 
be poor replicates of naturalistic sociali- 
zation, and might arouse irrelevant 
competing responses, if the child were 
punished for transgression on every 
trial. Children do not always make 
the socially disapproved response. 
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Blue responses were therefore inter- 
spersed with red responses which were 
not punished and were actually re- 
warded in the nurturance paradigm. 
This procedure made the label “blue” 
a more discriminative stimulus, and 
also permitted a comparison of the 
child’s adoption of two different kinds 
of labels. 

3. Since self-criticism had to be ob- 
servable on the test trials, a stimulus 
was required to elicit it in a verbal 
form. Moreover, the experimenter had 
to remain in the situation, in order to 
observe the response. At the same 
time, the experimenter’s punitive pres- 
ence had to be somehow perceived as 
temporarily absent or deferred, if the 
child were to have occasion and moti- 
vation to reproduce the experimenter’s 
evaluative responses. These require- 
ments for the test trials were met by 
having the experimenter: (a) appear 
to be absorbed in other matters and 
temporarily discontinue the socializing 
role, and (b) casually provide brief 
nondirective verbal-eliciting stimuli, so 
as to make the situation appropriate 
for a wide variety of verbal responses 
from the child. 


Subjects 


The subjects for the acquisition phase of 
the study were 89 girls drawn from the 
fourth- and fifth-grade populations of two 
public schools? The number of subjects as- 
signed to each of the four socialization para- 
digms was as follows: Labeling at Termina- 
tion of Punishment: 27; Labeling at Onset 
of Punishment: 24; Nurturance (with label- 
ing at onset of punishment) : 26; Control: 12. 


Procedure 


Each child was taken by the experimenter 
(E), who was a female, from her classroom 
to the experimental room. The child sat in 
front of a small table, upon which was a 


5 The author is grateful to a number of 
administrators and teachers in the Philadel- 
phia school system, whose cooperation made 
this study possible. 
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piece of apparatus referred to as 
chine” and also a small box cont 
Tootsie Rolls. The machine was 
enclosed, black, metal cabinet, rou 
inches wide, 10 inches high, and 
deep. In addition to a buzzer located 
the machine, the following items W 
bedded in its inclined face: an ac vom 
an electrical counter (with display), 
small lights (two red and two gre 
dered vertically along the left edge, ai 
locking lever switches running ho 
along the bottom edge and spaced 
inches apart. The primary purpose of t 
units was to convey to the child th 
machine was a formidable device. 
each unit was a decal purporting to 
a complex instrument within ci 
such as “ACO Narrow Band Monitor, 
In order to generate the operation of € 
machine on each trial, the child was g 
the task of pushing down the y 
levers that indicated her guess as to” 
many of four hidden dolls were looking 
her direction. The nature of the blue re- 
sponse was vaguely defined for the 
having to do with the way in wi 
levers were pushed. However, the 
occurrence of transgression was conu 
by E through a hidden silent mercury 
that monitored the activation of the units 
the machine. The children tended tò P 
first the lever on their far left. i 
was therefore made inactive, so that 
response would not always occur 
ginning of a guessing sequence. 
three levers were equivalent in that“ 
of them could activate the buzzer í 
other units. When activated, the 
on the voltmeter would swing, the 
would click and turn over once, 
or three of the four lights would 
(according to which of the levers had 
pushed). The child was told that 
would always be at least two dolls K 
toward her, in order to guarantee 
rence of the blue response on approp 
trials. Re i 
Approximately 6 feet behind the child, E 
sat at a second table, upon which was & min 
black cardboard screen. Behind the 
were the four identical small dolls : 
mercury switch. After E and S were seated, 
E used one of four paradigms, each of which 
consisted of initial instructions and @ 88. 
quence of training trials designed to 
difficult for S to anticipate events on ê 
given trial. Blue responses occutt' 
Trials 2, S, 6, 8, and 9, and. red responses 
Trials 1, 3, 4, 7, and 10. Each trial 
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at roughly 8 to 10 seconds after the previous 
one ended. 

Labeling at Termination of Punishment 
(LTP). The E's initial instructions were 


as follows: 


Here's what we're going to do today. 
Back here, I have four dolls (E held up 
dolls, then replaced them behind screen). 
I'm going to place some of them looking at 
you from behind here and some looking 
away (E appeared to handle dolls behind 
screen). You're supposed to guess how 
many are looking at you. There are al- 
ways at least two looking at you, but 
sometimes there might be more. Some- 
times three of them might be looking at 
you, and sometimes all of them. Each 
time, you guess, and then I change them, 
and then you guess again, 


The E then walked over to S’s table and 
Said: 


Now what I really want to see is how 
you work with a machine. The way you 
guess is: you tell the machine what you 
think is the answer. The machine is auto- 
matic. When I set the dolls, the machine 
is set, too, with the right answer. Here's 
the way the machine knows whether you've 
guessed how many dolls are looking at 
you: you push down as many levers as 
you think there are dolls. If you think 
there are two dolls looking at you, push 
two levers (E pushed down first two levers 
from the left. The mercury switch was 
open at this point, so that none of the 
levers were activated). If you think there 
are three dolls looking at you, push three 
levers (E pushed down third lever). And 
if you think there as four, push four (E 
pushed down last lever). Remember that 
you always push at least two, and leave 
them down. Don’t push them back up. 
The machine doesn’t show you whether 
you guessed right, but it tells my instru- 
ments, so I know. And when we're all 
done, I’ll tell you how many times you 
guessed right. Do you understand? 


After S’s indication of understanding, E 
reset all levers, returned to her own table, 
and, while appearing to handle the dolls, 
closed the mercury switch. Then E returned 
to S’s table and continued: 


Now the most important thing is the way 
you use the machine, the way you push 
down the levers. I’ve done this with a 
lot of girls, and I’ve found out that there 
are two different kinds of girls who use 


the machine in two different ways. I 
don't know what to call them, so I just 
call one kind the girls who do it the red 
way and the others the ones who do it 
the blue way. 

Over here is a pile of Tootsie Rolls for 
you (E removed Tootsie Rolls from box 
and placed them in front of F). When 
you use the machine the red way, it just 
tells my instruments over there what 
you've guessed (E pointed to her own 
table). You push the lever down with 
your finger from the top (E demonstrated, 
carefully pushing down the inactive lever). 
Now the machine has a buzzer that some- 
times goes on (here E ostensibly adjusted 
something at the back of the machine)—it 


them in this empty box here, because you 
did it the blue way (as E 
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cause you did it the blue way.” Just as E 
spoke the word “blue,” she simultaneously 
reset all of the levers that had been pushed 
down, so that the label was contiguous with 
the termination of both punishment and 
buzzer. Then E returned to her own table 
and began the next trial. 
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Labeling at Onset of Punishment (LOP). 
The procedure for this paradigm was identi- 
cal, in most respects, to that described for 
the LTP paradigm above, except that the 
label “blue” was used at onset of punish- 
ment during training trials and occupied a 
corresponding position in the demonstration 
that accompanied instructions. The modi- 
fication of instructions began at the point at 
which the Tootsie Rolls were introduced : 


Over here is a pile of Tootsie Rolls for 
you. When you use the machine the red 
way, it just tells my instruments over 
there what you've guessed. You push the 
lever down with your finger from the top 
(E demonstrated on inactive lever). When 
you use it the blue way, the machine has 
a buzzer that goes on—it sounds like this 
(E pushed down the second, active lever). 
You can hear the buzzer. and you can see 
what’s happening. When that happens, 
you lose some of your Tootsie Rolls. I 
take them away and put them in this empty 
box here (then E reset the levers and ter- 
minated the buzzer, but without using the 
label “blue” at this point). 


Nontransgression training trials were 
treated in the same way as in the LTP 
paradigm. On transgression trials, however, 
E rose and said, immediately at the onset of 
buzzer and punishment: “You did it the blue 
way—No!” Then E walked over to S's 
table and added, while looking at the ma- 
chine: “You lose some of your Tootsie 
Rolls.” E removed two Tootsie Rolls from 
S's pile, put them into the box, and ter- 
minated buzzer and punishment by resetting 
all levers. The label “blue” was not used, 
of course, at this termination point. 

Nurturance (with labeling at onset of 
punishment). The procedure in this para- 
digm was basically the same as that used in 
the LOP paradigm. Here, however, E’s 
manner and voice conveyed considerable 
warmth and approval of & (except on trans- 
gression trials), in contrast to her generally 
neutral demeanor in all of the other para- 
digms. The modifications of procedure en- 
tailed intersticed expressions of verbal and 
physical affection during the instructions and 
the nontransgression trials. Thus, after 
the initial distinction between the two kinds 
of gitls who used the machine the red way 
or the blue way, and just before introducing 
the Tootsie Rolls, E interjected the following 
comment (in a smiling, relaxed fashion) : 


Most of the girls in my own special 
machine class (E had been originally in- 
troduced to all Ss as a “special teacher”) 
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are girls who do it the red way. Asa 
matter of fact, you look something like 
one of them. I think I'm going to like 
you as much as I like my own girls (and 
E gently patted S on the head or shoulder). 


On transgression trials, E's behavior was 
identical to that described above for the LOP 
paradigm. On nontransgression trials, Æ 
rose and said, with maximal warmth and 
enthusiasm: “Good!—you did it the red 
way.” The E then walked over to S's table, 
reset the levers, and smilingly patted & on 
the head or shoulder before returning to 
her own table. 

Control (labeling without punishment). 
The procedure for the control paradigm was 
similar in most respects to the procedure 
used in the LTP paradigms. There was, 
however, no punishment. The last brief sec- 
tion of the instructions was modified to 
inform the child of the possibility of punish- 
ment, so that this paradigm would 
equated to others in having the general 
threat of punitive sanctions. The informa- 
tional role of the buzzer was also emphasized 
somewhat more it had been in other 
paradigms, in order to eliminate any punitive 
connotation in the association between the 
buzzer and the label “Blue After the in- 
itial introduction to the task, the modified 
section of the instructions continued as 
follows: 


Over here is a pile of Tootsie Rolls for 
you. I might take a couple of them away 
sometimes, depending on how you use the 
machine. The machine and I need to 
know whether it’s the red way or the blue 
way. So we have a buzzer in the machine. 
When you use the machine the red way, 
it just tells my instruments over there 
what you've guessed (E pushed down the 
inactive lever). When you use the ma- 
chine the blue way, we can tell that, too, 
because the buzzer goes on—it sounds like 
this (E pushed down second lever). You 
can hear the buzzer and you can see what's 
happening—because you did it the blue 
way (as E said the word blue, she reset 
the levers, terminating the buzzer). 


On trials in which the blue response oc- 
curred, E rose, walked over to S’s table and 
said: “Uh-huh, you did it the blue way. 
As E said the word “blue,” she terminat 
the buzzer. The E’s reaction to red re- 
sponses was the same as in the LTP and 
LOP paradigms. 

Test trials. Trials 11, 12, and 13. 
test trials uniformly employed immediately 
after all of the socialization paradigms. The 
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S's lever choices on Trials 11 and 12 were 
followed by the buzzer, in order to test S's 
use of the label “blue” in reference to her 
action. To test for S’s use of the label “red,” 
S's lever choices on Trial 13 were not fol- 
lowed by the buzzer. The S's verbal re- 
sponses on all test trials were recorded 
verbatim. 

The E remained seated behind S on Trial 
11, and appeared oblivious to the buzzer, 
having intentionally preoccupied herself with 
her recording sheets. If the child made no 
comment within 10 seconds after buzzer 
onset, E casually asked: “What happened 
this time?” This stimulus was used to elicit 
a verbal response from S. If S applied the 
label “blue” to her own behavior in any 
way, cither spontaneously or in response to 
the eliciting stimulus, Æ then terminated the 
test trial in the same way that she terminated 
trials in the preceding training series (in- 
cluding the labeling and punishment, where 
the latter was appropriate). If S made a 
verbal response that did not include the use 
of the label “blue,” E asked: “What do you 
mean?” After this last question, Æ ter- 
minated the test trial regardless of S's re- 
sponse. The punishment used in training 
was continued, in closing the test trial, so 
that the label “blue,” if used by the child, 
would not receive a direct external rein- 
forcement through punishment avoidance, 
since it was also to be tested on the subse- 
quent trial (12). 

The procedure for Trial 12 was the same 
as that for Trial 11, but was preceded by a 
Statement intended to facilitate spontaneous 
responses. Before Trial 12 began, E said: 
“My eyes are getting tired from looking at 
these instruments. After you've guessed this 
time, you can tell me what happened your- 
self.“ The same procedure was also used on 
Trial 13, but the introductory statement used 
on Trial 12 was not repeated. After testing 
for S’s use of the label “red,” E concluded 
Trial 13 in the way appropriate to red re- 
sponses on the preceding training series. 

Closing Procedure. A closing procedure 
was used to assure S of the adequacy of her 
performance, to relieve any anxiety about the 
loss of Tootsie Rolls, and to invoke S’s co- 
Operation in not discussing the experiment 
with other children. 


Results and Discussion 


Certain features of the children’s 
behavior during the training paradigms 
were strikingly apparent. They uni- 
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formly exercised great care in pushing 
the levers and showed obvious signs of 
discomfort upon the occurrence of 
transgression. Their pleased smiles in 
the nurturance paradigm also affirmed 
the experimenter's success in convey- 
ing affection. The power acquired by 
the buzzer was clearly visible on Test 
Trials 11 and 12 (except in the case 
of the control paradigm). The chil- 
dren were restless and tense, and most 
of them looked back expectantly at the 
now preoccupied experimenter, Al- 
though some children in the LTP 
paradigm made spontaneous labeling 
responses, the children generally ver- 
balized the labels only in response to 
the experimenter’s eliciting stimuli. 
This was true even on Trial 12, despite 
the initial statement that the children 
themselves could report on what hap- 
pened, and was very possibly attribut- 
able to the effective functioning of the 
labels at an implicit level. 

Numerous responses were given to 
the experimenter’s verbal stimuli. 
Many children simply offered repor- 
torial statements such as: “the buzzer 
went on,” “I pushed the levers,” “the 
lights went on,” “I don't know,” 
“you'll have to get up again,” “I lose 
some Tootsie Rolls,” etc. But only 
the relevant labeling responses, applied 
by the children to their behavior or 
to themselves, were used as evidence 
of self-evaluation. These generally 
took the form of: “I did it the blue ` 
(red) way!” or, less commonly, “I was 
blue (red)!“ Table 1 shows, for each 
of the test trials, the relative frequen- 
cies of the appropriate labeling re- 
sponses among the four paradigms. 
Chi-square values for comparisons of 
the frequencies are presented in Table 
2. The observations on Test Trial 12 
add only an indication of the stability 
of the responses seen on Trial 11. 

The paradigm that is singularly ef- 
fective in the induction of self-criticism 
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TABLE 1 


= Labeling at termination of punishment. 
1 


is the one in which the socializing 
model uses the critical label at the ter- 
mination of both the punishment and 
the externalized response-contingent 
signal of transgression. This finding 
strongly confirms the view that the 
child’s reproduction of a component 
of social punishment rests on its origi- 
nal external position with respect to 
the onset and attenuation of anxiety. 
The fact that the blue label clearly at- 
tains functional value only in the tem- 
poral setting of the LTP paradigm, 
despite its not having been overtly 
verbalized during training, also points 
to the acquisition of self-criticism as 
an instrumental response that does not 
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Control* 
(N = 12) 


7 direct external reinforcement. 

The infrequent use of the label among 
children exposed to a model who used 
it at onset of punishment and trans- 
gression signal, including those so- 
cialized in a nurturant context, makes 
it apparent that the reinforcing prop- 
erties of the label are not to be assigned 
to any generalized effect of punishment. 
There is also no support for the view 
that self-criticism represents a broader 
tendency to reproduce the behavior of 
a model who is nurturant but with- 
draws nurturance in order to deliver 
punishment. The results of the ex- 
periment indicate that self-criticism is 
acquired as a specific response through 


TABLE 2 
CHI-SQUARE VALUES FOR COMPARISONS AMONG FOUR EXPERIMENTAL SOCIALIZATION 
PARADIGMS OF THE FREQUENCY OF LABELING AND NONLABELING RESPONSES 


Comparison 


LTP versus LOP 

LTP versus nurturance (LOP) 
LTP versus control 

LOP versus nurturance (LOP) 
LOP versus control 

Nurturance (LOP) versus control 


Trial 13 
(Red label) 
O a 
0.12 
0.01 
0.02 
0.01 
0.16 
0.04 


SS 
Note.—Chi- e values for 2 X 2 contingency tables (em 


quencies in Table 1 
< .05 (one-tailed test). 
22 < .01 (one-tailed test). 


ploying correction for continuity) based on fre- 
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aversive learning. Its high probability 
of occurrence, when its stimulus prop- 
erties have been independently pre- 
sented as the terminal component of 
punishment, cannot be understood with- 
out reference to the mediation of anx- 
iety and anxiety reduction. 

The control paradigm eliminates in- 
terpretation of these findings in terms 
of associative processes not contingent 
on punishment—such as recency or 
stimulus impact (salience) effects stem- 
ming from the conjunction of the label 
with the termination of the buzzer at 
the end of each trial. It cannot be 
said, for example, that all children 
would be generally predisposed to imi- 
tate the experimenter’s labels, for rea- 
sons unrelated to their anxiety-reduc- 
ing value, unless the labels had been 
made aversive stimuli through their 
association with onset of punishment. 
Few children in the control paradigm 
used the label “blue,” even though it 
occurred at the termination of buzzer 
and training trial. Embedding the la- 
bel in a punitive context is clearly a 
prerequisite of its adoption by the 
children.“ 

A comparison of the frequencies of 
occurrence of the blue and red labels, 
following the LTP paradigm, also indi- 
cates no general tendency of the chil- 
dren to imitate the experimenter's ver- 
balization, and again points to the spe- 
cific anxiety-reducing function of the 
blue label. Of course, the red label 
would not have been expected to ac- 


ê The buzzer, though used as a cue, un- 
doubtedly had some inherent aversive proper- 
ties, the termination of which may have been 
sufficient in itself, for a few children, to lend 
anxiety-reducing value to the blue label in 
the control paradigm. The occasional use 
of the blue label by children exposed to the 
Paradigms where it appeared directly upon 
transgression may be attributable to covert 
Situational cues or irregularities of timing 
which permitted too much anticipatory anx- 
lety before its occurrence. 
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quire much value from its association 
with relatively neutral events. Its re- 
stricted use following nurturant sociali- 
zation might be regarded as more sur- 
prising, however, since there it occurs 
in the context of warm verbal approval 
and physical affection, It may be that 
the reinforcing effects of the kind of 
nurturance given are not as discrete 
and well defined as the effects of anx- 
iety reduction. But perhaps a more 
plausible explanation is that the moti- 
vation to produce an evaluative label 
associated with reward is difficult to 
establish in a learning situation so 
largely defined by punishment. In any 
case, the findings suggest that positive 
self-evaluation requires its own distinct 
reinforcing consequences. 


MAINTENANCE 


One of the remarkable characteris- 
tics of the internalized derivatives of 
social learning is their persistent main- 
tenance in the absence of external sur- 
veillance and reinforcement. Aversive 
responses, in particular, can often be 
quite resistant to extinction (Miller, 
1959; Solomon & Brush, 1956), and 
one would expect that self-criticism, a 
response that is even initially intrinsi- 
cally reinforced, might easily become 
independent of its original conditions 
of learning. As has already been 
pointed out, the punishment and criti- 
cism of parents sometimes follow fairly 
rapidly upon occurrence of a response 
to which internalized anxiety is to be 
attached. Yet once the child’s anxiety 
comes to precede its punishment, cer- 
tain cue components of the punishment 
become associated with anxiety reduc- 
tion. Correspondingly, in the experi- 
mental paradigm with labeling at ter- 
mination of punishment, onset of 
punishment was immediately coincident 
with onset of the transgression signal, 
but the (external) presentation of the 
critical label directly coincided with the 
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signal's termination. Such temporal 
relationships among the cues to which 
anxiety is attached, the occurrence of 
punishment, and the stimulus proper- 
ties of aversive responses, appear to 
contribute as much to the maintenance 
of these responses as to their acquisi- 
tion (Brush et al., 1955; Church et al., 
1956). 

Part of the relative insensitivity that 
self-criticism can show to changes in 
external conditions may be a function 
of the resistance to extinction of the 
intense anxiety attached to some trans- 
gressions (Bitterman & Holtzman, 
1952; Solomon & Wynne, 1954). The 
actions around which parents hope to 
inculcate the most permanent and gen- 
eralized internalization are those which 
they are likely to punish most severely. 
The failure of anxiety to disappear, 
when apparently deprived of its origi- 
nal support in the form of explicit pun- 
ishment, may also follow from a kind 
of inconsistent reinforcement related to 
generalization of perceived punitive 
events. Children may perceive puni- 
tive implications even where punish- 
ment is not intended, because they can 
so easily assimilate both social and im- 
personal events to their experience of 
the wide spectrum of punitive capaci- 
ties previously represented in the be- 
havior of socializing agents. In the 
ecology of interaction between parents 
and children, inconsistency of per- 
ceived punishment would be inevitable. 
And even beyond childhood, the most 
infrequent application of real or per- 
ceived punishment would be sufficient 
to interfere with the extinction of in- 
ternalized anxiety. 

While internalized reactions to trans- 
gression do persist in the absence of 
predictable external punishment, their 
endurance when followed by punish- 

ment perhaps even more effectively 
shows that their reinforcement is not 
tied to external consequences. Chil- 


dren are often punished for their ac- 
tions despite the intervening implicit 
or overt presence of self-criticism. The 
cue properties of their self-critical re- 
sponses would, under these circum- 
stances, precede external punishment. 
They would therefore no longer have 
quite the same signal value, with re- 
spect to onset and termination of anx- 
iety, which they initially had as a 
component of punishment. The anx- 
iety-reducing status of self-criticism 
could not be sustained under such con- 
ditions, if it actually remained depend- 
ent on the temporal relationship of the 
critical labels to the external events 
which originally marked anxiety reduc- 
tion. Moreover, the continued experi- 
ence of social punishment affords chil- 
dren many opportunities to observe 
that internalized anxiety can be ter- 
minated without their criticizing or 
punishing themselves, though not in a 
way that is equally under their control. 
It is possible, of course, that the ir- 
regular presence of punishment serves 
primarily to strengthen the anxiety 
that motivates self-criticism. A num- 
ber of studies with animals have indi- 
cated that punishment of aversive re- 
sponses heightens their resistance to 
extinction (Gwinn, 1949; Sidman, 
Herrnstein, & Conrad, 1957; Solomon 
& Wynne, 1954). But whatever the 
effects of external punishment upon the 
self-critical response, once it is estab- 
lished, it seems necessary to attribute 
its persistent maintenance to the anx- 
iety-reduction attached to its intrinsi- 
cally produced cues. 

In order to assess the durability of 
the tendency to reproduce a verbal 
component of punishment, three dis- 
tinct attempts were made to eliminate 
the use of the blue label among addi- 
tional groups of children, after they 
had acquired it through exposure to 
the socialization paradigm with labeling 
at termination of punishment. The 
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three procedures may be designated as 
extinction paradigms, even though the 
original reinforcement (anxiety reduc- 
tion) for the self-critical response can- 
not really be withdrawn by changes in 
external conditions. One group was 
taken through a procedure intended to 
diminish the anxiety-reducing proper- 
ties of the blue label by changing its 
temporal relationship to both punish- 
ment and anxiety. The procedure used 
with the second group was directed to 
the question of whether the label would 
retain its value over a period during 
which punishment and anxiety reduc- 
tion occurred without its being ver- 
balized by either child or socializing 
agent. The third paradigm, which 
made the most direct attack upon the 
autonomy of self-criticism, removed 
punishment as well as verbalization of 
the critical label, in an attempt to ex- 
tinguish the anxiety attached to the 
transgression. 


Three Extinction Paradigms 


Subjects 


Thirty-seven additional subjects, drawn 
from the same population used in the original 
four socialization paradigms, were exposed 
to the LTP paradigm. Thirty of these new 
subjects met the previously employed acqui- 
sition criterion, and 10 were assigned to 
each of the three extinction series. 


Procedure 


In an effort to eliminate the self-critical 
response before it was given repeated overt 
verbalization, all of the extinction procedures 
were begun as soon as & verbalized the label 
“blue” on either of the two acquisition test 
trials (11 or 12), and all subsequent trials 
were run as the extinction series, The pro- 
cedures were run so that each two trans- 
gression (buzzer) trials were followed by 
a nontransgression trial, in order to avoid 
uninterrupted punishment in the first two 
extinction paradigms. Thus, beginning with 
the first acquisition test trial, the sequence 
of transgression trials during extinction was: 
11, 12, 14, 15, 17, 18-etc.7 


7 Children to be taken through the two 
extinction paradigms in which punishment 
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Extinction Paradigm 1. In this series, E 
continued to use, on transgression trials, the 
same verbal eliciting stimuli used on acquisi- 
tion test trials. As soon as S gave the self- 
critical response on either Trial 11 or 12, 
and regardless of S's response on subsequent 
transgression trials, E rose and said, while 
walking to S’s table: “Well, you lose some 
of your Tootsie Rolls“ The Æ then re- 
moved two Tootsie Rolls from S's supply 
and terminated the buzzer, but no longer used 
the label blue. On nontransgression trials, 
E simply rose and reset the levers on the 
machine (without using the label red“). 
The effect of this procedure was that the 
label “blue,” when now “presented” by & 
during transgression trials, was consistently 
followed by punishment and temporary con- 
tinuation of the buzzer that signaled trans- 
gression, Extinction trials were run until 
there were four consecutive transgression 
trials without verbalization of the blue label, 
or until 10 transgression trials had occurred, 
although additional trials were permitted if 
S showed signs of extinguishing. 

Extinction Paradigm 2. The blue label 
was eliminated from punishment during this 
procedure, and E also discontinued the verbal 
stimuli used to elicit S’s overt expression of 
the label. The S was consequently exposed 
to termination of both punishment and buzzer 
without verbalization of the label, Follow- 
ing S’s self-critical response on Trial 11 or 
12, E rose, administered the usual punish- 
ment without using the blue label, and ter- 
minated the buzzer. On all subsequent trans- 
gression trials, E carried out the same pro- 
cedure, beginning immediately upon onset of 
the buzzer. On nontransgression trials, E 
reset the levers without comment. The ex- 
tinction series was run through Trial 25. 
Trials 26 and 27 were extinction test trials, 
and restored the verbal eliciting stimuli used 
by E on acquisition test trials. If S ver- 


was continued were provided, before training 
began, with 20 Tootsie Rolls (instead of the 
15 provided in the original socialization para- 
digms), so that they could continue to be 
deprived of them during the entire sequence 
of transgression trials throughout the ac- 
quisition and extinction procedures. 

8 The expletive, “No!,” used as a component 
of punishment during training trials, was re- 
moved in this first extinction paradigm, Its 
continued use, once the child began to ver- 
balize the blue label, might have led to 
elimination of the label for reasons unrelated 
to the change in its temporal relationship to 
anxiety and punishment. 
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balized the blue label on neither of the two 
test trials, the self-critical response was con- 
sidered to be extinguished. 

Extinction Paradigm 3. In this paradigm, 
both punishment and verbalization of the 
label “blue” were entirely removed. When 
S gave the self-critical response on Trial 11 
or 12, E merely rose, walked over to S's 
table, and terminated the buzzer. On subse- 
quent extinction trials, both transgression 
and nontransgression, E continued to simply 
walk over and reset the levers without com- 
ment. The extinction criterion was S's 
failure to use the self-critical label on either 
of the two extinction test trials (26 and 27), 
when E’s eliciting stimuli were restored. 


Results 


The effective induction of self-criti- 
cism, in the socialization paradigm with 
labeling at termination of punishment, 
was again confirmed by the fact that 30 
of the 37 children taken through the 
paradigm met the acquisition criterion. 
Table 3 shows that, in each of the three 
extinction groups, only two of the ten 
children used met the extinction cri- 
terion. None of the children in whom 
the response failed to extinguish gave 
any indication of relinquishing their 
use of the critical label. Under the 
first extinction procedure, children who 
did not meet the extinction criterion 
used the label uniformly throughout 
the extinction trials. And in the sec- 
ond and third paradigms, children who 
retained the response invariably gave 
it on both of the extinction test trials. 

The three extinction paradigms do 
not go to the limit of effort that could 
be made to eliminate the self-critical 
response. Other methods which have 
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been used to facilitate extinction of 
aversive responses, such as pairing the 
anxiety-inducing stimulus with positive 
reinforcement (Hall, 1955) or reward- 
ing an incompatible response (Solomon 
& Wynne, 1954), are not represented 
here. It seems apparent, however, that 
the experimentally socialized labeling 
response has some of the tenacity that 
we might expect of internalized self- 
evaluation. 


Some FURTHER THEORETICAL 


IMPLICATIONS . 


The experimental literature on pun- 
ishment learning typically centers on 
the conditioning of anxiety (or “aver- 
siveness”) to the organism’s own re- 
sponse cues, and the consequences of 
punishment are indexed in the sup- 
pression of the punished behavior 
(Dinsmoor, 1954; Estes, 1944; Miller, 
1959; Mowrer & Viek, 1948). The 
inhibition of transgressions would be, 
of course, one of the derivatives of 
their definition through social punish- 
ment. But self-criticism provides a 
response for reducing the anxiety that 
follows the commission of a transgres- 
sion. The response does not require 
reinforcement by avoidance or termina- 
tion of external punishment, and it ac- 
tually reproduces stimulus events which 
have been previously part of that pun- 
ishment. We are confronted, then, 
with an anxiety-reducing response 
whose acquisition is contingent on the 
prior association of its stimulus prop- 


Reproduction of Punishment 


TABLE 3 


FREQUENCY OF EXTINCTION AND NONEXTINCTION OF THE SELF-CRITICAL RESPONSE 
UNDER EACH OF THREE EXTINCTION PARADIGMS 


Extinction 1 
Response (N = 10) 
Extinguished 2 
Did not extinguish 8 


Extinction 2 incti 
(N = 10) Extinction 


2 2 
8 8 * 
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erties with the presence rather than 
with the absence of punishment. 

It is not really paradoxical that self- 
criticism can be acquired, and often 
permanently maintained, despite the 
fact that it reproduces punishment. 
Aversive responses have frequently 
been shown to be at least as sensitive 
to anxiety reduction as they are to 
punishment avoidance, when these two 
outcomes are partially separated, and 
sometimes they seem relatively im- 
pervious to their punitive consequences 
(Gwinn, 1949; Sidman & Boren, 1957; 
Solomon & Wynne, 1954). It is pos- 
sible, in fact, to interpret certain studies 
of both human and animal subjects 
(D’Amato & Gumenik, 1960; Farber, 
1948; Maier, 1949) as indicating that, 
under high anticipatory anxiety and 
with no easily available avoidance re- 
sponse, subjects will move in the direc- 
tion of external punishment because of 
its anxiety-attenuating significance. In 
a similar way, the anxiety that moti- 
vates self-criticism is initially dependent 
on the motivating properties of punish- 
ment, while the reinforcement of the 
response is dependent on its replica- 
tion of the anxiety-reducing signal 
properties of punishment. Self-puni- 
tive responses are, therefore, clearly 
not to be regarded as having been 
originally learned as avoidance re- 
sponses. Their self-reinforcing char- 
acter is acquired on the basis of 
anxiety-reduction as the “primary rein- 
forcement.” 

It is interesting to note that Freud 
(1936, Ch. 8) saw anxiety reduction as 
the function of children’s adoption of 
the punitive or threatening behavior of 
a model, and that its role has also been 
previously suggested in concepts such 
as “identification with the aggressor” 
(A. Freud, 1946, Ch. 9) and “defensive 
identification” (Mowrer, 1950, Ch. 21): 
Certainly it is not difficult to perceive 
that the mechanism of reinforcement in 


211 


the learning of self-criticism may serve 
as the general mechanism for what may 
be called identification through aversive 
learning—that is, for the acquisition of 
any response in which, without the 
benefit of external reinforcement, the 
individual directs toward himself the 
socially punitive behavior that he has 
previously experienced from without.“ 

Further research on the anxiety- 
reducing properties of social punish- 
ment, and on their imitative reproduc- 
tion, might well give attention to the 
intensity of punishment. There is no 
reason to assume that the punishment 
to be reproduced must be less “painful” 
than the antecedent anxiety. Never- 
theless, when punishment is very se- 
vere, whatever decrement of antic- 
ipatory anxiety attaches to its cue 
properties could be overshadowed by 
the increment of aversive stimulation 
from its drive properties. Experimen- 
tal socialization paradigms might be 
used to compare the conditions required 
for the learning of self-criticism with 
those required for the self-directed 
replication of nonverbal components of 
punishment, some of which would be 
more inherently unpleasant than a ver- 
bal label. Such paradigms might also 
be devised so as to suggest the theo- 
retical extensions necessary to account 
for the child’s learning to direct its 
reproduction of punishment toward 
others. They could further be used 
to examine other relationships of tim- 


9 Children also often react to the trans- 
gressions of others by reproducing the same 
punitive responses to which they have been 
previously subjected. A tentative beginning 
toward accounting for this phenomenon 
might be offered by noting that stimulus 
generalization may operate to cause the 
transgressions of others to arouse the child’s 
anxiety, and to elicit its reproduction of 
punishment, if they are similar to those for 
which the child itself has been previously 
punished. One must also account, however, 
for the change in the target of the punish- 
ment. 
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ing and magnitude among components 
of punishment. Presumably, any com- 
ponent of punishment could assume 
anxiety-reducing value, provided that 
it follows some minimal internalized 
anxiety. But the breadth of the phe- 
nomenon may be limited by the dura- 
tion and relative severity of the remain- 
ing punishment that follows a given 
component. Verbal social responses 
which already have become mildly 
punitive to the child are often used by 
socializing agents as cues to signal 
more severe punishment if a transgres- 
sion is completed or continued. It 
would be interesting to know whether 
such cues attain some anxiety-reducing 
value, despite their position as warn- 
ing signals, because of their mediation 
of inhibitory responses which avoid the 
punishment, Finally, perhaps the most 
intriguing experimental translations of 
naturalistic socialization would be in 
the area of inconsistent punishment. 
Studies of the discriminative events 
within which punishment is inconsist- 
ent, and of the extent to which socializ- 
ing agents provide consequences of 
transgression other than direct punish- 
ment, might reveal that inhibition, self- 
punishment, and other internalized re- 
actions to transgression have common 
elements other than the anxiety that 
motivates them and the anxiety reduc- 
tion that serves as their reinforcement. 


Nurturance and the Internalized Con- 
sequences of Punishment 


A comparison of the effects of the 
various socialization paradigms de- 
scribed earlier indicates that the inter- 
nalization of punishment does not rest 
on a broader predisposition to repro- 
duce the behavior of an otherwise nur- 
turant or affectionate model. Nor does 
it appear, as some theorists have sug- 
gested (Kagan, 1958; Maccoby, 1959; 
Whiting, 1960), that self-criticism rep- 
resents a generalized tendency to exer- 
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cise control over the threatening puni- 
tive resources at the disposal of a 
socializing agent. While the duration 
and intensity of nurturant treatment 
are more extended in a parent-child 
relationship, the warm approval and 
physical affection given to the children 
in the nurturance paradigm would cer- 
tainly have been expected to attach 
some positive value to the model’s role 
and to cause punishment to be experi- 
enced as a withdrawal of affection. 
And, in view of the effectiveness of the 
paradigm with labeling at termination 
of punishment, the failure of the nur- 
turance paradigm to induce many self- 
critical responses cannot be attributed 
simply to an insufficiency of nurtur- 
ance, The reproduction of social pun- 
ishment appears to be acquired only 
when the relevant components of pun- 
ishment have a circumscribed temporal 
relationship to an anticipatory aversive 
state. 

The contributions of punishment and 
anxiety to the learning of self-criticism 
should not be extrapolated, of course, 
to other kinds of imitative social learn- 
ing, in which children do appear to 
provide their own reinforcement by 
reproducing those stimulus aspects of 
a model which have been directly asso- 
ciated with nurturance or reward. 
Persuasive theoretical cases have been 
made for the place of a reinforcement 
mechanism derived from reward in 
much of imitative behavior (Mowrer, 
1950, Ch. 21; Mowrer, 1960b, Ch. 3; 
Sears et al., 1957, Ch. 10; Whiting & 
Child, 1953, Ch. 11); and there are 
empirical findings (Bandura & Hus- 
ton, 1961; Mussen & Distler, 1959; 
Sears, 1953) which support the com- 
mon observation that the warmth and 
affection of socializing agents foster the 
child’s tendency to adopt a great variety 
of their responses. We can hardly ! e j 
surprised, however, to find that the * 
production of social punishment cannot 
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be subsumed under the consequences of 
a model’s rewarding characteristics. 
Children sometimes reproduce behay- 
ioral pieces of the most entirely fright- 
ening figures which they encounter in 
both reality and fantasy. Similar phe- 
nomena in adults are described in 
Bettelheim’s (1943) well-known re- 
port of life in a concentration camp, 
and are also seen by the sensitive ob- 
server under ordinary social circum- 
stances. Conversely, there is no reason 
to expect the child’s motivation to re- 
produce the behavior of a nurturant 
model to be so powerfully indiscrimi- 
nate, even under the duress of with- 
drawal of affection, as to encompass 
the model’s punishment. An account 
of the diversity of modeling responses 
usually taken as evidence of identifica- 
tion requires, then, an extension of 
learning processes derived from pun- 
ishment as well as of those derived 
from reward.?° 

The internalized consequences of so- 
cial punishment would, nevertheless, 
not be totally indifferent to the nur- 
turance and affection of socializing 
agents. Numerous descriptive surveys 
of internalized responses to transgres- 
sion, whose rough indices of internali- 
zation have given much attention to 
self-criticism, indicate the absence of 
any consistently direct relationship be- 
tween extent of internalization in the 
child and the amount of nurturance 
given by parents (Bronfenbrenner, 
1961; Heinecke, 1953; Sears et al., 
1957, Ch. 10; Whiting & Child, 1953, 
Ch. 11). There is evidence, however, 


10 Even if the effects of the parents’ nur- 
turance were so powerful as to generalize 


and attach some value to their punitive pres- 


ence, a gradient of generalization would yield 
the hardly credible prediction that, if the 
child transgressed and consequently felt de- 
Prived of their affection, it would then be 
More likely to apply their rewarding actions 
to itself than it would any component of 
their punishment. 


that the usual effects of punishment 
upon internalization require a mini- 
mally nurturant context. The children 
of extreme parental groups, who are 
highly rejecting or punitive toward 
their offspring, appear to show poor 
internalization in their social behavior 
(Bandura & Walters, 1959; Nye, 
1958; Sears et al., 1957, Ch. 10; 
Whiting & Child, 1953, Ch. 11). Loss 
of affection is probably a major ele- 
ment in the effectiveness of any form of 
social punishment administered by par- 
ents to their children. This element 
could be negligible, if parents have 
been so generally cold or punitive as 
to adapt the child to a high level of 
aversive stimulation in their presence. 
Their specific punishments might then 
not produce an increment of anxiety 
sufficient to independently attach itself 
to the child’s own response cues, so 
that it could motivate self-criticism or 
other reactions to transgression even 
in the absence of external control. It 
seems even more probable that parents 
who are generally lacking in nurturance 
use unusually severe punishments. The 
intensity of ordinary social punishment 
permits its components to have some 
significance for attenuating anxiety and 
is not likely to disrupt the child’s atten- 
tion to this significance. Severe pun- 
ishment, in contrast, may evoke com- 
peting responses, such as withdrawal 
or escape, which are more closely asso- 
ciated with anxiety reduction than is 
any aspect of the punishment itself. 
The key role that has been assigned 
to self-criticism in many surveys of 
internalization also affords an oppor- 
tunity to inferentially confirm the im- 
portance of the temporal relationships 
so vividly illustrated in the experimen- 
tal paradigms. A number of investi- 
gators (Allinsmith, 1960; Aronfreed, 
1961; MacKinnon, 1938; Sears et al., 
1957, Ch. 10; Whiting & Child, 1953, 
Ch. 11) have found a rather limited 
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positive association between the extent 
of children’s internalization of re- 
sponses to transgression and their par- 
ent’s use of methods of punishment 
variously referred to as “love oriented,” 
“psychological,” or “induction” tech- 
niques. Withdrawal of affection has 
always been included among such tech- 
niques, and it often has been presumed 
to be their dominant common element. 
But the experimental findings reported 
here do not support the view that an 
explicit withdrawal of affection by a 
nurturant model induces a state of 
deprivation which is singularly effective 
in motivating a child’s reproduction of 
the model’s critical evaluative re- 
sponses. The apparent discrepancy 
may be resolved by observing that, 
while loss of affection would enter into 
the child’s experience of any form of 
punishment employed by its parents, 
their use of withdrawal of affection as 
the focus of their punishment might 
well incorporate certain features of ad- 
ministration and timing which are 
much less characteristic of other dis- 
ciplinary techniques. These features 
are precisely those which have been 
artificially removed in the experimen- 
tal nurturance paradigm, where the 
socializing agent's critical labels ac- 
quire little or no anxiety-reducing 
value because they are presented imme- 
diately upon the child’s awareness of 
transgression, before substantial anx- 
iety can be evoked. 

Parents who focus on withdrawal of 
affection in their discipline may be 
those who are also most oriented to- 
ward inducing an internal governor of 
conduct in their children, in contrast 
to other parents who are more oriented 
toward sensitizing their children to the 
punitive external consequences of trans- 
gressions with which they are visibly 
confronted (Aronfreed, 1961; Bron- 
fenbrenner, 1958). Accordingly, par- 
ents who emphasize withdrawal of af- 
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fection may be more disposed to punish 
transgressions well beyond the point of 
their occurrence. And they may tend 
to maintain the cognitive and affective 
salience of their children’s transgres- 
sions by reinstating affection only after 
some time has elapsed. It is the man- 
ner in which they use withdrawal of 
affection, then, that would induce in 
their children internalized anxicty of 
greater and more uncertain duration 
than that induced by other more direct 
and quickly terminated punishments. 
This more variable and prolonged pe- 
riod of anxiety would sustain a greater 
probability that the child would repro- 
duce components of parental criticism, 
or otherwise make its own anxiety- 
reducing responses, rather than wait 
upon external events. The fact that 
deprivation of tion does not re- 
quire the physical presence of the par- 
ents (and is often effected by their 
absence) is also useful in understanding 
why it makes the child prone to produce 
anxiety-reducing stimuli through its 
own resources. 


Cognitive Processes in Self-evaluation 


The critical label “blue,” used in the 
experimental socialization paradigms, 
was primarily a color designation in 
the children’s previous experience. It 
was applied to a transgression that was 
not really within their volitional control 
and not clearly related to evaluative 


11 The survey findings are often ambiguous 
because highly verbal parental behaviors, 
such as reasoning, explanation, and appeals 
to the child's already established standards, 
are commonly grouped together with actual 
withdrawal of affection within the class of 
disciplinary techniques which seem to have 
some association with greater evidence of 
internalization in the child. The findings 
could therefore indicate that it is the verbal 
and cognitive context of punishment, rather 
than the specific properties of deprivation of 
affection as a method of punishment, that 
fosters the internalization of self-criticism. 
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standards with which they were al- 
ready familiar. The presence and ab- 
sence of the label during training were 
not even respectively associated with 
responses which could be intrinsically 
identified by the children. Yet, under 
the appropriate temporal conditions, 
they adopted the blue label to refer to 
their own actions—a behavioral change 
that appears to demonstrate that the 
motivational prerequisites for the learn- 
ing of self-criticism may be present in 
the most limited cognitive surround- 
ings. The infrequent application of 
the red label to nonpunished actions, 
even among children who did use the 
blue label and who would surely have 
some cognition of the blue-red distinc- 
tion, suggests further that these moti- 
vational precursors are essential to the 
acquisition of self-evaluative responses. 

That the cognitive context of a label- 
ing response is insufficient to fully de- 
fine its evaluative properties, without 
taking into account the affective condi- 
tions under which it is acquired, is also 
made clear by more general considera- 
tions. The discrepancies which indi- 
viduals may perceive between their 
behavior and their standards, particu- 
larly in areas of performance or 
achievement, are subjected to evalua- 
tive cognition and sometimes result in 
subsequently rewarded modifications of 
ehavior. But their evaluative proc- 
esses would not require that they re- 
produce, and apply to their own be- 
havior, a verbal component of the 
Punishment to which they have been 
previously exposed. Nor do the re- 
warding consequences of their behav- 
‘oral corrections reveal the motivation 
or the reinforcement for their self- 
Critical responses. When self-criticism 
does occur, its reinforcement cannot 
be understood without reference to 
its aversive motivational antecedents, 
Which are in turn dependent on the 
affective correlates of evaluative proc- 
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esses. We seem to be in the habit of 
regarding self-evaluation as being the 
instrumental verbal responses which 
are motivated by these affective cor- 
relates. But evaluative processes might 
better be conceived of as cognitive and 
affective phenomena which do not in 
themselves require verbal praise or 
censure. 

The treatment of self-criticism as an 
instrumental labeling response does not 
circumvent the importance of under- 
Standing the complex cognitive proc- 
esses upon which self-evaluation is 
often based. We might consider, for 
example, whether reproducing the ver- 
bal components of social punishment is 
to be regarded as a moral response— 
or, to put it another way, whether it 
is to be taken as evidence of guilt. In 
the normative framework of our so- 
ciety, the term moral is commonly re- 
stricted to actions which follow from 
some degree of judgment exercised by 
the actor. The evaluative dimensions 
of such judgment are presumed to be 
ethical in nature, in the sense that they 
make articulate distinctions between 
right and wrong or good and bad. But 
many of the internalized derivatives of 
social reinforcement, including the ap- 
plication of either punitive or reward- 
ing labels to one’s own actions, may 
require only very rudimentary and ten- 
uous cognitive support. Many other 
internalized responses reflect socially 
derived cognitive processes which are 
highly developed along evaluative di- 
mensions which are not necessarily 
moral. This would surely be the case 
with the self-evaluation that attaches to 
success or failure in certain kinds of 
performance or control. 

There are formidable problems asso- 
ciated with any attempt to define moral 
cognition in a way that escapes cultural 
relativism. It might be possible, how- 
ever, to assign invariant properties to 
moral evaluation by reference to par- 
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ticular kinds of cognitive processes. 
Certainly the moral properties of an 
evaluative response would be deter- 
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for a given society and operationalized 
in verbal behavior, guilt must be viewed 
as an essentially phenomenological con- 
struct of variable substance. 


12 An excellent discussion of the place of 
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INTRATRIAL AND INTERTRIAL RETENTION: 


NOTES TOWARDS A THEORY OF FREE 
RECALL VERBAL LEARNING’ 


ENDEL TULVING 


University of Toronto 


Trial-to-trial analysis of recall of individual items in a multitrial free 
recall learning experiment shows that the traditional learning curve 
can be expressed as an additive function of intratrial and intertrial 
retention. Intertrial retention, measured in terms of number of items 
retained from one trial to the next, increases as a logarithmic func- 
tion of trials, while intratrial retention, measured in terms of number 
of items retained from the input phase of a given trial to the output 
phase of that trial, remains practically invariant over trials. Inter- 
trial retention is positively correlated with subjective organization, 
giving support to the hypothesis that the increase in intertrial reten- 
tion as a function of practice reflects the growth in the size, but not 
necessarily the number, of subjective units of material that the subject 


can retrieve from the memory storage. 


One of the most universally known 
and reliable phenomena of memory lies 
in the improvement in recall of verbal 
materials under conditions of practice. 
Experiments in verbal learning, as in 
many other areas of psychological in- 
quiry, often yield inconsistent and con- 
tradictory data, but no experimenter 
has ever reported that his human sub- 
jects, appropriately instructed, failed 
to increase their recall over trials of 
an experiment. 

In spite of, or perhaps because of, 
the wide generality of the phenomenon, 
empirical and conceptual analyses of 
it have seldom been reported in the 
literature. A great deal of experi- 
mental evidence and attendant theo- 
retical speculation is available about 
conditions and variables that influence 


1 This research has been supported by the 
National Research Council of Canada, under 
Grant No. APA-39, and by the National 
Science Foundation, under Grant No. GB- 
810. A preliminary version of the paper 
was read as an invited paper at the annual 
meeting of the Canadian Psychological As- 
sociation in Montreal, June, 1961. The 
writer is indebted to George Mandler for 
constructive comments and cogent criticisms. 


the rate at which recall changes over 
trials, that is, about interaction effects 
involving trials as one of the inde- 
pendent variables, but systematic at- 
tempts to account for the effects of 
trials per se have not been popular. 

The present paper reports an analy- 
sis of the trial-by-trial memorization 
of a list of words under the method 
of free recall (FR). In a typical FR 
experiment with which we are con- 
cerned the subject is exposed to a list 
of words in the “input” phase of a 
trial and is then asked to recall as 
many of these words as he can, in 
any order he wishes. Then the same 
words are presented again, usually in 
a different order, and again at the end 
of the presentation, in the “output” 
phase of the trial, the subject recalls 
the words he remembers. This proce- 
dure, alternating input and output 
phases, can be repeated for any num- 
ber of cycles or trials. 

The subject’s task in the FR experi- 
ment appears to be relatively simple. 
Response learning in the sense of re- 
sponse integration (Mandler, 1954) is 
minimized whenever more or less 
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familiar words are used, and associa- 
tive learning is not necessary since 
the experimenter gives credit to the 
subject for words recalled regardless 
of their order. Memorization of a list 
of words in the FR experiment is 
rather reminiscent of that aspect of 
response learning that has to do with 
strengthening of responses specified by 
the input list so that they have “greater 
response strength than the many other 
responses in the repertoire which are 
not in the list the subject is to learn 
[Underwood & Schulz, 1960, p. 93].” 
If this reasoning is correct, then FR 
learning must involve only some of 
the processes underlying acquisition in 
serial and paired-associated learning 
experiments. It thus constitutes a 
simpler task for the subject to master 
and a simpler situation for the experi- 
menter to analyze. 

Probably because of its apparent 
simplicity, free-recall learning has 
failed to arouse much curiosity among 
experimenters. Only a handful of 
studies having to do with multitrial 
FR learning have been reported in the 
literature (Bruner, Miller, & Zimmer- 
man, 1955; Horowitz, 1961; Murdock, 
1960; Waugh, 1961). But it has ap- 
pealed to several theorists who have 
constructed various stochastic models 
to describe its data (Bush & Mosteller, 
1955; Miller & McGill, 1952; Waugh 
& Smith, 1962). All of these models 
have succeeded in providing good or 
even excellent mathematical descrip- 
tions of certain quantitative aspects of 
empirical findings. Yet they have not 
been primarily concerned with isola- 
tion and identification of processes in- 
volved in memorization of verbal ma- 
terial. The Waugh and Smith (1962) 
model perhaps goes farthest of the 
three in that it refers to three hypo- 
thetical processes—“labelling,” “select- 
ing,” and “fixing”—identified with the 
three parameters of the model, but 
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these processes are not related to any 
“classical psychological functions [p. 
142].“ 

Another characteristic that the sto- 
chastic models have in common lies 
in their assumption that individual list 
items are memorized independently of 
one another. The recall of a given list 
item is supposed to have no effect on 
the recall or nonrecall of any other 
item. The fact that such an assump- 
tion is not tenable is well known to 
the authors of the statistical models, 
although the implications of the cor- 
rect assumption of item relatedness 
have not yet been fully explored. In 
considering the problem briefly, Miller 
and McGill (1952) suggested “as- 
sociative clustering should affect the 
variability, not the rate, of memoriza- 
tion [p. 390],” but this somewhat 
startling hypothesis has not been sub- 
jected to an empirical test. 

Finally, the description of FR learn- 
ing provided by the stochastic models 
is difficult to reconcile with the funda- 
mental fact that an individual list item, 
if it has been previously integrated, can 
be “learned” by the subject on one 
trial of a very short duration, in the 
sense that the subject can always recall 
it immediately after seeing or hearing 
it once. What is it that permits the 
single item to be labeled, selected, and 
fixed with a probability of unity, to 
use Waugh and Smith’s language, if 
it is presented alone, but probabilities 
much smaller than unity when the item 
appears in a list? This particular 
criticism, of course, can be directed at 
almost any theoretical account of memo- 
tization, both formal and informal, that 
has ever been proposed, but the com- 
monality of an error does not justify it. 

Our present analysis of FR learning 
takes as its starting point the obser- 
vation that a small unit of verbal ma- 
terial, such as an individual list word, 
is always “learned” at the time of its 
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presentation to the subject. It then 
proceeds to examine to what extent 
these individual items are retained or 
forgotten, both within a single trial 
and over successive trials. The tradi- 
tional learning curve will be described 
as a composite of intratrial and inter- 
trial retention curves. Finally, inter- 
trial retention as the main component 
of the subject’s total recall performance 
will be shown to be related to the ex- 
tent to which subjects organize their 
recall. It will be argued that the trial- 
by-trial improvement in recall is a 
consequence of the development of 
higher-order units of material which 
mediate the retrieval of the informa- 
tion from the memory storage. 


One-TriaL LEARNING AND INTRA- 
TRIAL FORGETTING OF INDI- 
VIDUAL ITEMS 


Let us begin by considering what 
happens on the first trial in a typical 
FR experiment. Each subject will be 
able to recall some but not all words 
in the output phase of the first trial. 
Such limitation of first-trial recall has 
traditionally been ascribed to some 
limitation in the learning mechanism, 
that is, it has been interpreted as re- 
flecting the fact that the subject, for 
some reason or other, cannot learn all 
words on the first trial. For instance, 
in the well-known debate on whether 
associations in paired-associate learn- 
ing tasks are formed in an all-or-none 
or incremental fashion, the partici- 
pants, although disagreeing on the 
main issue, seem to agree on interpret- 
ing nonrecall of some response items 
as evidence for absence of, or incom- 
plete formation of associations in the 
input phase preceding the recall test 
(Estes, 1960, 1961; Postman, 1963; 
Rock, 1957; Underwood & Keppel, 
1962). 

Such an interpretation, if taken lit- 
erally, may prove somewhat misleading 
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in pursuing the problems involved in 
verbal learning. Recent evidence shows 
clearly that a small unit of well-inte- 
grated verbal material, or an associa- 
tion between two such units, can prac- 
tically always be recalled immediately 
following its presentation (Brown, 
1958; Murdock, 196la, 1961b, 1963; 
Peterson, Peterson, & Miller, 1961; 
Peterson, Saltzman, Hillner, & Land, 
1962; Tulving & Arbuckle, 1963). 
When we use probability of response 
as a measure of learning, therefore, 
we must conclude that learning of a 
small unit of material is always com- 
plete on a single trial. Learning, in 
this sense, is neither incremental nor 
all-or-none, it is always Call.“ 
Whether the small unit is presented 
alone or in a series of other units of 
the same class is immaterial as far as 
its one- trial acquisition is concerned. 
If we presented a list of words to the 
subject and warned him in advance 
that we might stop the presentation at 
any time and test him on the word 
seen last, we would have every reason 
to believe that his recall would be prac- 
tically perfect. 

2 Rock, in considering the problem of 
“why, in multiple-item learning situations, 
it is not possible as a rule to learn more 
than a few associations on any one trial,“ 
mentioned the possibility that an association 
is always formed and “that the failure to 
get all items right on a test following the 
entire series is a matter of forgetting of 
already formed associations [Rock, 1957, p. 
192].” He rejected this interpretation for 
reasons that are not entirely clear to this 
writer. 

3 It is possible, of course, that subjects in 
certain types of experiments may in effect 
“ignore” some input items at the time of 
their presentation and concentrate their at- 
tention on a few specific items. One such 
selective strategy has been proposed by Fei- 
genbaum and Simon (1962) in their account 
of the serial position curve. These strate- 
gies might indeed explain why the immediate 
recall of single items or associations in some 
experiments has been somewhat less than 
perfect, but their existence cannot change 
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This “learning” of an individual 
item — whether we call it learning, per- 
ception, fixation, registration, or what 
not, does not matter, as long as we 
know the operational referent of the 
term—depends solely upon variables 
operating prior to and at the time of 
the presentation of the item. It must 
be distinguished from the retention of 
the item which depends on the condi- 
tions prevailing at the time of the 
learning as well as on certain variables 
operating in the retention interval, 
however short. Thus the fact that an 
individual item is always “learned” 
following a single presentation does 
not mean that it might not be forgotten 
later. Small amounts of material, well 
within the classical memory span of 
the individual, often become unavail- 
able for recall in a matter of a few 
seconds (Brown, 1958; Murdock, 
1961a, 1961b, 1963; Peterson & Peter- 
son, 1959; Tulving & Arbuckle, 1963). 
Nor does the conception of one-trial 
learning in any way preclude the pos- 
sibility that additional presentations or 
rehearsal of the once learned and re- 
called item may change its resistance 
to forgetting. Just as overlearning of 
larger units of material, such as lists 
of nonsense syllables, has been long 
known to improve the retention of 
material (Ebbinghaus, 1885), smaller 
units have also been shown to benefit 
from repeated presentation (Peterson 
& Peterson, 1959; Peterson, Saltzman, 
Hillner, & Land, 1962; Waugh, 
1962b). 

It seems reasonable to argue, in view 
of the foregoing considerations, that 
limited recall on the first trial does not 
necessarily reflect incomplete learning, 
but rather, incomplete retention. Al- 
though the intratrial retention inter- 
vals, between the presentation and the 
. TS 
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attempted recall of each individual 
item, are short, many learned items 
can be forgotten within that interval. 
Whether such intratrial forgetting is 
interpreted in terms of the decay of 
memory traces (Broadbent, 1958; 
Brown, 1958; Conrad & Hille, 1958), 
in terms of input and output interfer- 
ence (Tulving & Arbuckle, 1963), or 
in terms of yet other factors, is not 
immediately relevant to our present 
purposes. What is important is the 
empirical fact that some individual 
items, although always recallable im- 
mediately following their presentation, 
do become unavailable for recall wel 
within the interval occupied by a single 
trial. The recall scores on the first 
trial, therefore, reflect the combit 
effects of one-trial learning and ini 
trial forgetting, or simply of intra 
retention. * 

We may, if we wish, regard the 
subject’s recall score on the first trial 
as a measure of his “immediate mem- 
ory,” provided that we realize that 
such “immediate” memory is limited 
for the simple reason that it is not 
immediate. In the FR experiment, it 
is only the last item that the subject 
could recall immediately after its pres- 
entation, but in a learning task in 
which the subject has to recall the 
items in the order of their presentation, 
recall is not immediate for any single 
item. 

The notions of one-trial learning and 
intratrial forgetting suggest that it is 
not the fact of “storage” of list items 
that is at issue in the FR experiment, 
but rather the form of storage, or ac- 
cessibility, of items (cf. Miller, Ga- 
lanter, & Pribram, 1960). The list 
items have been “stored” in the sub- 
ject’s memory a long time before he 
appears for the experiment and the 
input list serves simply as a set of 
instructions as to which of the stored 
items the subject has to retrieve. In 
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this sense, then, what the subject has 
to do in the FR experiment is to in- 
crease the accessibility of those items 
that are specified by the input list so 
that they can be readily retrieved from 
the storage. 

Be it as it may, there are certain 
advantages in thinking of the first- 
trial recall as a measure of retention 
rather than learning. First, it might 
be easier to deal with questions such 
as why some items are retained and 
others forgotten within a single trial 
(e.g., Tulving & Arbuckle, 1963) than 
questions such as why some items are 
and others are not learned, or why 
each item is learned only to a certain 
extent and not completely. In terms 
of our existing conceptual tools as well 
as available empirical evidence we are 
probably better equipped to handle 
problems of retention and forgetting 
than of original acquisition. Second, 
and more important for the task at 
hand, the concept of intratrial reten- 
tion leads naturally to the concept of 
intertrial retention and thus suggests 
a potentially useful way of looking at 
trial-by-trial recall data. The relation 
of intratrial and intertrial retention to 
the overall recall performance will be- 
come clear if we consider the pattern 
of recall of individual list items from 
trial to trial. This will be done in 
the next section. 


TRIAL-TO-TRIAL ANALYSIS OF RECALL 


We have argued that under appro- 
priate conditions each list item is al- 
ways learned on the first trial and that 
the subject’s performance in the output 
phase of that trial is to be regarded as 
a measure of intratrial retention. The 
subject’s performance on the second 
trial, and on all succeeding trials, can 
Similarly be regarded more profitably 
as a measure of retention. On these 
trials, however, there are two sources 
of retained items. Some items could 
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be recalled in the output phase of the 
second trial, for instance, even without 
the benefit of their re-exposure in the 
input phase of the second trial: the 
subject remembers some of the re- 
sponses he made in the output phase 
of the first trial. The number of these 
responses provides a measure of inter- 
trial retention. In addition to these 
responses, the subject, in the output 
phase of the second trial, also remem- 
bers some of the responses he made in 
the input phase of the second trial. 
Their number provides a measure of 
intratrial retention. The subject's 
overall recall performance on all trials 
except the first, therefore, consists of 
two components—intertrial and intra- 
trial retention. 

The conception of recall perform- 
ance in terms of the process of reten- 
tion or, looking at the other side of the 
coin, of forgetting, is not a novel one. 
McGeoch, for instance, more than 20 
years ago, pointed out the relations 
involved : 

The changes in behavior (verbal re- 
sponses) acquired during the first trial are 
retained, at least in part, until the second 
trial. There new ones are added to those 
retained; some or all of the results of the 
first and second trial are retained until the 
third trial, when more are added, and so on, 
until practice stops. Not all of the ac- 
quisitions at each successive trial are carried 
over to the next; some are forgotten and 
must be refixated. A curve of learning 
represents a progressively greater balance in 
favor of retention, so that it is, in part, a 
retention curve. . . Fixation and retention 
thus mutually interact in the course of what 
we call learning [McGeoch, 1942, pp. 4-5]. 

A single practice trial, as well as a 
series of trials, thus serves at least 
two functions. It provides the learner 
with the opportunity to study and re- 
tain individual items, but certain events 
occurring on the trial must also be 
responsible for forgetting of some other 
items. The traditional procedure used 
in verbal learning experiments, in 
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which the experimenter assumes some 
kind of an equivalence of different list 
items and expresses the subject's per- 
formance in terms of the total number 
or proportion of items recalled, per- 
haps simplifies computations, but it 
clouds the two roles played by the 
trial. 
The traditional method for describ- 
ing the subject’s performance in a 
verbal learning experiment is based on 
what might be called the trial-by-trial 
analysis of recall. On every trial, the 
experimenter partitions the total set 
of L items in the list into two mutually 
exclusive subsets, P, and V.. Pn (Per- 
formance) refers to the number of 
items that the subject has recalled on 
Trial n, while N, refers to the number 
of items that the subject has failed to 
recall on that trial. P,+N,=L, 
hence the specification of P. completely 
determines its complement N, and 
nothing further can be gained from 
measuring N,. The traditional learn- 
ing curve is simply the plot of the size 
of the subset P, against trials. 

The trial-by-trial analysis of recall 
is to be contrasted with what will here 
be referred to as the trial-to- trial 
(TTT) analysis. This analysis is an 
extension of that used in Estes’ (1960) 
“miniature experiments” to the typical 
multitrial experiment. In the TTT 
analysis, the Set L is partitioned with 
respect to pairs, triplets, etc., of suc- 
cessive trials. In the present paper, 
however, we shall be concerned with 
the TTT analysis as applied only to 
successive pairs of trials. 

The subject’s recall on two succes- 
sive trials, n — 1 and n, defines within 
the Set L two subsets, P. and P,, 
thus placing each element of the Set 
L (each word in the list) into one of 
four mutually exclusive subsets, These 
four subsets are shown in the Venn 
diagram in Figure 1. The large circle 
Tepresents the Set L, the small circle 
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Fic. 1. Venn diagram representing four 
components of performance derived from the 
application of the trial-to-trial analysis to 
consecutive pairs of trials. 


on the left represents the subset Pn, 
consisting of items that the subject re- 
called on Trial n— 1 and the small 
circle on the right represents subset 
Pa, consisting of items that the subject 
recalled on Trial n. 

The four mutually exclusive subsets 
shown in the diagram are designated 
as follows: . 

Cn-1Cn (or simply CC) is the inter- 
section of the subsets P. and Pa. It 
consists of items that occur in the 
subject’s recall both on Trial n — 1 
and on Trial n. The size of this subset 
can be regarded as an estimate of 
intertrial retention. 

Nunn, (NC), consists of items that 
occur in the subject’s recall on Trial u, 
but not on Trial n— 1. Its size pro- 
vides an estimate of intratrial retention. 

Cr+Nn, (CN), consists of items re- 
called on Trial n — 1, but not recalled 
(“forgotten”) on Trial n. Its size can 
be thought of as an estimate of inter- 
trial forgetting, 

Naa, (NN), consists of items 
that the subject fails to recall both on 
Trial n—1 and on Trial n. Its size 
can be regarded as an estimate of intra- 
trial forgetting. 

The logical definition of the four 
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subsets is straightforward, but the 
identification of the subsets with psy- 
chological processes of intratrial and 
intertrial retention and forgetting in- 
volves assumptions that may not be 
quite correct. For instance, referring 
to the CC component of performance 
as an estimate of intertrial retention 
ignores the possibility that an item in 
Subset CC was recalled on Trial n, 
forgotten by the time it was shown to 
the subject again in the input phase of 
Trial n + 1, “relearned” then, and re- 
tained until the output phase of Trial 
n-+ 1. Thus, both intertrial and intra- 
trial retention may contribute items to 
the CC component which therefore may 
overestimate the amount of material 
that the subject can retain from one 
trial to the next without any interven- 
ing input. For the same reason the 
NC component may underestimate in- 
tratrial retention. Moreover, the ex- 
tent of these errors of estimation may 
vary systematically over trials. These 
relations clearly constitute problems 
for further research. For the time 
being, however, it does not seem com- 
pletely unreasonable to regard the four 
measures—CC, NC, CN, and NN—as 
estimates of intertrial and intratrial 
retention and forgetting as designated 
above. 

If the TTT analysis is extended over 
all consecutive pairs of trials (Trials 
O and 1, 1 and 2, 2 and3,...N-1 
and N, where N refers to the total 
number of trials), and if the size of 
each of the four subsets is plotted 
against trials, four curves are obtained : 
an intertrial retention curve, an intra- 
trial retention curve, an intertrial for- 
getting curve, and an intratrial for- 
getting curve. 

These four curves contain all the 
information that is available in the 
trial-by-trial “learning curve,” but they 
also lay bare the anatomy of such a 
curve. Any two of the four curves 
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can be mathematically derived from 
the other two, even though for a given 
pair of trials only one of the four meas- 
ures is redundant with the other three. 
This fact reflects the complementarity 
of the definitions of the two processes, 
retention and forgetting: whatever the 
subject does not retain, he has for- 
gotten, and vice versa. It also means 
that the traditional learning curve, in 
which P is plotted against trials, can 
be described in terms of any two of 
the four components. 

In this paper, we shall be mainly 
concerned with the following relation: 


Pa = CnC + Nn-Cn [1] 


Equation 1 is a simple mathematical 
statement to the effect that perform- 
ance is an additive function of two 
components, CC (intertrial retention) 
and NC (intratrial retention). Per- 
formance, either on a single trial or 
over a number of successive trials, can 
be analyzed into the CC and NC com- 
ponents, and these two components, 
again either on a single trial or over 
successive trials, can be synthesized 
into overall performance. 


EMPIRICAL SYNTHESIS AND MATHE- 
MATICAL ANALYSIS OF THE FREE 
RECALL LEARNING CURVE 


Let us next examine some data from 
a simple FR experiment and demon- 
strate, first, how the learning curve 
can be derived from the empirically 
determined measures of CC and NC, 
and second, how the two components 
can be estimated from the overall per- 
formance data.* 

Thirty-two summer school students 
served as subjects in the experiment. 
Each subject learned a list of 22 words 
on 22 trials by the method of FR. 
Words were presented to the subject 

4Thanks are due to Albert S. Bregman 


who collected these data and wrote the elec- 
tronic computer programs for analyzing them. 
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by means of a Gerbrands memory 
drum, at the rate of one word per 
second. Subjects recalled words by 
speaking them out aloud. Their re- 
sponses were recorded on tape and 
later transcribed. No definite amount 
of time was given for recall in the 
output phase; when the subject had 
been silent for 10 seconds, the input 
phase of the next trial was begun. 
There were four different, but for- 
mally equivalent stimulus lists. Each 
list was learned by eight different sub- 
jects. The lists are shown in Table 1. 
The words were selected from the 
Thorndike-Lorge (1944) word book, 
according to the following criteria: 
(a) All words were two-syllable nouns, 
or words that could be used as nouns. 
(b) No two words in the same list 
began with the same letter, (c) In 
each list, the words were distrib- 
uted approximately equally among 
four frequency-of-occurrence catego- 
ries. These four categories were: 


TABLE 1 
Lists or Worps USED IN THE EXPERIMENT 


accent amice action answer 
barrack | bridle bandage | buyer 
center canard | country | cherub 
drumlin | daddy dipper despot 
entry express | effort ether 
finding flower farrow fasces 
garden gable gambler | gorget 
hoyden hormone | hamlet hermit 
issue impact island journal 
jungle juror kitchen | letter 
kernel kitty legion mantel 
lagoon lactose miser natron 
maxim midden | noodle orphan 
newell novice octroi person 
office ocean pilgrim | question 
pomade | quarter quinsy rennin 
quillet rumor rennet satin 
relique trochee | stamen tempest 
surtout | union trollop umbra 
treason virgin voter vulture 
valley wafer waiver windrow 
walker zither zenith xylem 
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words occurring more than 100 times 
per million, those occurring 14 to 16 
times per million, those occurring 4 
times per million, and those occurring 
4 times per 18 million. Apart from 
these restrictions, the words were as- 
signed to different lists in a haphazard 
fashion, 

The order of words within a given 
list varied systematically from trial to 
trial. Twenty-two different orders 
were constructed according to the 
method used in a previous paper (Tul- 
ving, 1962a). Each word appeared 
in each serial position just once over 
the total block of 22 trials, and was 
preceded and followed immediately, as 
well as by lags of 1, 2, 3, etc., by every 
other word in the list just once. This 
arrangement provides for complete ab- 
sence of second-order sequential con- 
straint among items in input lists over 
the block of 22 trials, and it mini- 
mizes all higher-order redundancies. 
The sequence in which different input 
orders of a given list were presented 
to subjects on successive trials was 
also systematically varied. Every sub- 
ject received the input orders on suc- 
cessive trials in a different sequence. 

The sizes of each of the four subsets 
defined by the TTT analysis were 
determined for all 22 pairs of trials 
(Trials 0 and 1, 1 and 2, 2 and 3,... 
21 and 22), separately for each subject. 
The mean sizes of the four subsets, 
based on data from all 32 subjects, 
were then calculated. In Figure 2, 
these mean sizes of the four subsets 
are plotted against trials, yielding four 
different curves. 

_ The intertrial retention (CC) curve 
is the only one that rises systematically 
over trials. It is a monotonically in- 
creasing negatively accelerated curve. 
The intratrial retention (NC) curve 
1S an approximately linear curve with 
a negative slope. The intertrial for- 
getting (CN) curve is obviously non- 
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monotonic: it rises for the first few 
trials and then, apart from the pre- 
sumably “chance” deflection upward 
at Trial 8, remains practically parallel 
with the abscissa. The intratrial for- 
getting (NN) curve is a monotonically 
decreasing curve with a negatively ac- 
celerated slope. 

Both the theoretical and actual val- 
ues for CC and CN on Trial 1 are 0, 
since subjects can neither retain nor 
forget any items from Trial 0 to Trial 
1. These values are not shown in 
Figure 2. The value of NC on Trial 
1, on the other hand, is determinable, 
and so is the value of NN. NoC, cor- 
responds to the performance score on 
Trial 1: NoC,y=Py and NoN, =L 
— NoC,. 

On inspection of the data it appeared 
that, as a first approximation, the in- 
tertrial retention (CC) curve could be 
described as a linear function in log n. 
The relation appears in Figure 3. The 
ordinate shows the mean number of 
items in the CC category in the recall 
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Fic. 2. Mean sizes of subsets correspond- 
ing to four different components of per- 
formance derived from trial-to-trial analysis 
of recall data. 
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Fic. 3. Intertrial retention as a function 
of trials. (Trials on the abscissa are pre- 


sented on the logarithmic scale. The ordi- 
nate represents the mean number of words 
recalled on two consecutive trials, » — 1 and 
n. The straight line was fitted to the data 
by the method of least squares.) 


of 32 subjects, the abscissa refers to 
trials on a logarithmic scale. 

The least-squares method yielded the 
following equation for CC as a func- 
tion of trials: * 


CiC, = 11.29 log n—.02 [2] 


This equation is represented by the 
straight line drawn in Figure 3. The 
fit is obviously very good. It is inter- 
esting to note that the curve intersects 
the ordinate corresponding to n = 1 at 
a CC value of 0 as it should, since, 
by definition, CoC, must be zero. An 


5 All equations fitted to the data can be 
considered to hold only within the limits of 
the number of trials used in the experiments. 
The CC function, for instance, cannot rise 
indefinitely, since the length of the list will 
impose a limit on the subjects’ recall. This 
limit, of course, does not characterize the 
subjects’ information-processing ability but 
only the experimental conditions under which 
this ability is permitted to operate (cf. 
Waugh, 1962a). 
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important implication of this fact is 
that one could, for the data in the 
present sample, predict the whole CC 
curve rather accurately from a single 
point, the CC score on Trial 2, by 
drawing a straight line through the 
zero intersect and that point. 
Intratrial retention (NC) is shown 
as a function of trials in Figure 4. As 
a first approximation, for the data in 
this particular sample, the relation be- 
tween mean NC and trials can be 
represented by a linear function with 
a slight negative slope. The least- 
Squares method yielded the following 
equation for NC as a function of trials: 


V. c. 40 o [3] 


The next step was to add alegbrai- 
cally, on the basis of the definition of 
performance given in Equation 1, the 
intertrial and intratrial retention func- 
tions to obtain the equation of the 
trial-by-trial learning curve. The ex- 
pression on the right-hand side in 
Equation 4 is the algebraic sum of 
corresponding expressions in Equa- 
tions 2 and 3: 


P= 11.29 Jog n . Ion 4 485 [4] 


MEAN Nn cn 


2 4 6 8 0 2 14 6 8 20 2 
TRIALS (n) 


Fic. 4. Intratrial retention as a function 
of trials. (The ordinate represents mean 
number of words recalled on Trial u but not 
Trial n— 1. The straight line was fitted to 
the data by the method of least squares. ) 
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We have now “synthesized” the 
learning curve from two empirically 
determined components of perform- 
ance, intertrial and intratrial retention. 
How well does such a curve fit the 
performance data? Equation 4 is 
graphically depicted in Figure 5, to- 
gether with mean performance data on 
22 trials. The fit appears good. The 
Standard error of estimate, op.) was 
found to be .246. 

Let us now turn to the problem of 
the mathematical analysis of the learn- 
ing curve into its two component func- 
tions, without going through the rela- 
tively tedious operation of counting 
items in the CC and NC categories in 
the subjects’ recall protocols. The 
mathematical analysis is, in a sense, a 
complementary operation to the syn- 
thesis of the learning curve from the 
empirically determined CC and NC 
functions. 

The logarithmic function of the gen- 
eral form, 

P=alogn+bn+c [5] 
suggested by trial-to-trial analysis, is 
a statement to the effect that P is a 
linear function of two variables, log n 
and n. It is identical with the multi- 
ple regression equation of the form, 
Y =aX,+bX2+c. Its parameters, 
a, b, and c, can be estimated from the 
product-moment correlations among P, 
log n, and n in the same manner as 
the constants in the multiple regres- 
sion equation. 

Such a multiple regression analysis 
of the mean performance data over 22 
trials in the experiment, based on the 
method of least squares, yielded the 


following equation of the learning 
curve: 


P= 10.70 log n —.073n + 8.07 [6] 


Equation 6 is to be compared with 
Equation 4, in which the parameters 
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MEAN PERFORMANCE 


2 4 6 8 0 R K 6 iB 20 22 
TRIALS (n) 


Fic. 5. The logarithmic learning curve. 
(The smooth curve represents the function 
obtained by the algebraic addition of the 
intertrial and intratrial retention functions 
shown in Figures 3 and 4. The data points 
represent mean number of words recalled by 
32 subjects.) 


were estimated from the empirically 
determined CC and NC components of 
performance. The agreement between 
the two sets of values, while not per- 
fect, is close enough to suggest that 
they represent the same population 
parameters and that the differences are 
attributable to errors of measurement. 
In fact, it can be shown that with in- 
finitely large samples in which sam- 
pling error is eliminated, the two pro- 
cedures, TTT analysis and regression 
analysis, would yield identical results, 
provided of course that the forms of 
the functions correspond to those in 
the present sample, linear in log n 
for CC and linear in n for NCS 


While it can be argued that the main 
value of equations fitted to learning curves 
lies in new insights they generate, rather 
than in their ability to describe the data, it 
is not without interest that the logarithmic 
curves do seem to fit the FR data quite well. 
The multiple regression method was used to 
generate learning curves, of the form given 
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INTRATRIAL RETENTION AND PRACTICE 


We have seen that both logically and 
empirically performance can be shown 
to be an additive function of two com- 
ponents, intratrial and intertrial re- 
tention. For the data from our dem- 
onstration experiment, intratrial reten- 
tion decreased as a function of trials, 
while intertrial retention increased. 
Since the rate of increase in intertrial 
retention exceeds the rate of decrease 
in intratrial retention, total recall goes 
up over trials, 

In evaluating the intratrial retention 
data we must keep in mind a possible 
artifact that might be responsible for 
the observed decrements in NC. As 
the subject’s intertrial retention in- 
creases, there remain fewer and fewer 
items in the list that the subject could 
retain from the input phase of a given 
trial. This would be particularly true 
for short lists that are mastered rela- 
tively quickly. It is possible, there- 
fore, that the negative slope of the 
mean NC function, whenever it oc- 
curs, is attributable to the combined 
effects of limited list length and rapid 
learning by some subjects, and that it 
does not reflect the relation between 
practice and intratrial retention faith- 
fully. 

Three observations bearing on this 
issue should be mentioned. The first 
concerns the NC functions for “fast” 
and “slow” learners in the experiment 
described above. The equation of the 
NC function for the subgroup of 16 


by Equation 5, for five other sets of FR data 
collected at Toronto. The goodness of fit of 
these curves was then compared with that 
provided by exponential functions of the 
form, Pa- ber (Murdock, 1960). In 
each case, the standard error of estimate was 
much lower for the logarithmic than the ex- 
ponential curve, the difference being pri- 
marily attributable to the superior ability of 
the logarithmic curve to describe data from 
the first few trials. 
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fast learners (those with the mean P 
scores above the group median) turned 
out to be VIC, = 5.4 —.18n, while 
for the 16 slow learners it was 
N. C. = 4.4 - .03n. Thus for those 
subjects whose performance is presum- 
ably less affected by the list length, the 
NC function is almost flat. 

The second source of evidence sup- 
porting the hypothesis that the negative 
slope of the NC function represents an 
artifact lies in the findings of an un- 
published experiment conducted at To- 
ronto. In this experiment, 24 subjects 
learned three successive lists of 52 
words each by the method of FR. 
The list words covered a wide range 
of Thorndike-Lorge (1944) frequen- 
cies. They were presented to the sub- 
jects on eight trials, at the rate of one 
second per word. The subjects had 
104 seconds at the end of each input 
phase to recall as many words as they 
could. Their oral recall was recorded 
on tape and later transcribed. 

In this experiment, no subject ever 
recalled more than 37 items correctly 
on any single trial, and mean P on the 
last trial, Trial 8, was only 26.2. The 
NC data from this experiment, even 
when averaged over all subjects, should 
thus be relatively free from any arti- 
factual constraints. These NC data, 
pooled for the 24 subjects over all 
three lists, are shown in Figure 6. 

The slope of the function is still 
negative, but it is smaller than that 
in Figure 4, namely —.05. More im- 
portant, the negative slope is largely 
produced by the mean NC value on the 
very first trial. There seems to be 
little systematic change in NC over 
Trials 2 to 8. The finding that NC 
on Trial 1 is higher than on all subse- 
quent trials has occurred in all sam- 
ples of data that we have examined to 
date, and eventually we will have to 
come to grips with the theoretical prob- 
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Fic. 6. Intratrial retention as 
of trials for the experiment invo! 
of 52 words. (The ordinate represent 
number of words recalled on Trial # 
on Trial n— 1. The straight line v 
to the data by the method of least squares.) 


lems that this finding entails. For the 
time being, however, we can conclude 
that for long lists intratrial retention 
seems to be essentially constant over 
trials. 

The third observation relevant to 
the hypothesis comes from an interest- 
ing experiment reported by Murdock 
and Babick (1961). These investiga- 
tors presented a “critical word” (CW) 
to the subjects on a number of succes- 
sive trials, but always in the context 
of different lists. They found that the 
probability of recall of CW was not 
influenced by the frequency of pre- 
vious presentations. This probability 
can be regarded as a measure of intra- 
trial retention, estimated under condi- 
tions where list length does not impose 
any artifactual limits on recall. Since 
it did not change as a function of trials, 
we can conclude that intratrial reten- 
tion is independent of practice. 

It must be noted that Murdock and 
Babick examined only the probability 
of initial recall of their critical words 
while our NC measure includes both 
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initial recalls and re-recalls of pre- 
viously recalled and then forgotten 
items. The comparability of the two 
sets of data, therefore, may seem ques- 
tionable. But the fact that the number 
of items in the NC category—particu- 
larly for longer lists that are not 
mastered very rapidly—remains nearly 
constant over trials suggests that recall 
on Trial n of items not recalled on 
Trial n — 1 is independent of the fre- 
quency of previous recalls of these 
items and that in this sense original 
recalls are quite comparable to re- 
recalls. In a typical FR experiment 
the number of initial recalls per trial 
usually decreases as practice proceeds, 
and the number of re-recalls increases. 
The total number of all items in the 
NC category, however, remains prac- 
tically invariant at all stages of prac- 
tice. 

At first glance the conclusion that 
recalls of items on Trial n following 
nonrecall on Trial n — 1 is independent 
of frequency of previous recalls may 
appear at variance with the well-known 
fact that the probability of recall of an 
item is a direct function of the number 
of times that it has been recalled be- 
fore (e.g, Miller & McGill, 1952; 
Underwood, 1954). Furthermore, 
Waugh and Smith (1962) have shown 
that the probability of recall of an 
item on Trial n following its nonrecall 
on Trial n—1 does depend on the 
frequency of previous recalls. 

The contradiction, however, may be 
more apparent than real. The proba- 
bility of occurrence of events of a given 
class may increase, but if, at the same 
time, the size of that class decreases, 
the frequency of occurrence of these 
events may not change at all, or even 
change in the opposite direction. For 
instance, an item that has been re- 
called, say, on eight successive trials, 
and then forgotten on the ninth trial, 
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may be recalled again on the tenth 
trial with a very high probability, but 
if recall on the ninth trial depends on 
the frequency of recall on the previous 
eight trials, there will be very few 
items that are forgotten on the ninth 
trial. In short, the apparent contra- 
diction between Waugh and Smith's 
conclusion and the hypothesis that in- 
tratrial retention is independent of fre- 
quency of previous recalls remains a 
contradiction only as long as we con- 
fuse relative frequency, that is proba- 
bility, with absolute frequency of oc- 
currence, 

In the light of these observations, 
then, it seems reasonable enough to 
entertain the hypothesis that in an 
ideal FR experiment, in which the 
subject’s performance is determined 
only by the capacity of his memory, 
intratrial retention is independent of 
practice. The possibility, mentioned 
earlier, that the NC component under- 
estimates intratrial retention and that 
intratrial retention may in fact in- 
crease over trials, however, cannot yet 
be completely ruled out. 


INTERTRIAL RETENTION AND SuBJEC- 
TIVE ORGANIZATION 


There is relatively little that needs 
to be said about intratrial retention in 
this paper. We have argued, in the 
light of empirical data, that intratrial 
retention remains essentially invariant 
over trials, and invariances of nature 
do not require explanations in the 
same sense as do variances. There 
are, however, several implications that 
the alleged invariance in intratrial re- 
tention has for the theory of FR learn- 
ing. One such is provided by the fact 
that if intratrial retention is independ- 
ent of trials, it must also be independent 
of intertrial retention. Regardless of 
the number of items that the average 
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subject remembers from his previous 
recall, he always remembers the same 
number of “new” items from the im- 
mediately preceding input, where new 
items are those that did not occur in 
the immediately preceding output. Re- 
call of the “old” does not seem to 
interfere with recall of the new. The 
exact significance of this and other im- 
plications may become clearer when 
we understand the processes involved 
in intratrial and intertrial retention. 
For the present we can only conclude 
that the limitation found in immediate 
memory (Miller’s magical number 
seven, 1956b) also seems to apply to 
memory for new items on later trials. 
In this last section of the paper we 
return to the problem of improvement 
in recall over trials. In the light of 
the preceding analysis this has now 
become the problem of increments in 
intertrial retention over trials. Even 
though the TTT analysis has contrib- 
uted to the conceptual clarification of 
the nature of the traditional learning 
curve, it cannot illuminate the proc- 
esses responsible for various compo- 
nents of recall performance. Some 
insight into the process of intertrial 
retention, however, can be derived 
from certain additional findings from 
the demonstration experiment described 
earlier in this paper. These findings 
pertain to subjective organization. 
Subjects do not only recall more 
and more items from the input list as 
practice proceeds in the FR experi- 
ment. They also impose an increas- 
ingly tighter sequential organization on 
the recalled material. It is this or- 
ganization, absent in input lists and 
present in output lists, that is referred 
to as subjective organization (SO). 
The concept has been discussed and a 
method for its measurement has been 
described in a previous paper (Tul- 
ving, 19622). In that paper it was 
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Fic. 7. Mean subjective organization 
(SO) scores on successive blocks of three 
trials for 32 experimental and 32 statistical 
subjects. 


also argued that trial-by-trial incre- 
ments in performance are functionally 
dependent upon organization. 

This argument can now be further 
evaluated by examining the relations 
between SO on the one hand and the 
components of performance that have 
emerged from the TTT analysis, intra- 
trial and intertrial retention, on the 
other hand. In the demonstration ex- 
periment, SO (Lag 0) was calculated 
for each of 32 subjects on successive 
blocks of three trials (Trials 2-4, 5-7, 
. . . 20-22), as well as on blocks of 
seven trials (Trials 2-8, 9-15, 16-22). 

Figure 7 shows the mean SO (Blocks 
= 3) for seven successive blocks of 
trials. It also shows mean SO scores 
for 32 statistical subjects, whose per- 
formance was matched with that of the 
experimental subjects, subject by sub- 
ject and trial by trial. It can be read- 
ily seen that SO increases lawfully 
over trials for the real subjects, but 
not for the statistical subjects. 

Figure 8 shows two parallel effects 
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logarithmic transformation of mean subjec- 
tive organization (SO) scores for blocks of 
three trials. (The left hand ordinate repre- 
sents intertrial retention scores, the right 
hand ordinate shows the values of log 100 
[mean SO —.100].) ç 


of practice, increasing intertrial reten- 
tion and increasing SO. Mean CC 
scores are plotted against blocks of 
three trials, together with a logarith- 
mic transformation of the mean SO 
scores. The correspondence between 
the two curves is very good, the 
product-moment correlation for the 
seven pairs of mean scores being 
+.996. The important fact, of course, 
is that both variables increase mono- 
tonically over trials; the close corre- 
spondence between the two curves in 
Figure 8 depends on the transforma- 
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tion of SO scores and on the appro- 
priate placing of the values on the 
two ordinates. 

Does this high degree of correlation 
between SO and CC still occur when 
trials are held constant and the corre- 
lation coefficient is computed for sub- 
jects? Two variables that covary for 
experimental conditions need not co- 
vary for subjects (Mandler, 1959). 
To examine the interrelations among 
different response variables available 
from the experiment, product-moment 
correlations of various measures were 
calculated for all 32 subjects. For 
each subject, mean P, mean CC, mean 
NC, and log SO (Lag 0) were cal- 
culated for three successive blocks of 
seven trials, omitting Trial 1. These 
data are summarized in Table 2. 

In interpreting these correlation co- 
efficients, we must remember that as 
subjects approach the limit of per- 
formance determined by the list length 
L, their CC scores must necessarily 
become higher and NC scores lower. 
Thus, the negative correlations be- 
tween CC and NC, as well as between 
NC and other measures, on Trials 9-15 
and 16-22, are probably to a large 
extent artifactual. Data from the first 
block of trials may be less biased in 
reflecting the relations among these 
response variables. It should also be 


TABLE 2 


Propuct-MomENT CORRELATION COEFFICIENTS AMONG MEAN P, MEAN CC, Mean NC, AND 
Loc SO (Lac O) oN THREE BLOCKS OF SEVEN TRIALS FOR 32 SUBJECTS 


Blocks of trials 
Variables 
Trials 2-8 Trials 9-15 Trials 16-22 
Log SO and mean P +.506 +.722 +.843 
Log SO and mean CC +.584 +.769 +.862 
Log SO and mean NC —.075 —.783 —.839 
Mean CC and mean NC —.018 —.881 — .943 
Mean CC and mean P +.948 +.986 +.988 
Mean NC and mean P +.295 —.792 —.882 
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remembered that correlations between 
P on the one hand and CC and NC on 
the other hand are, in a sense, spurious 
correlations, since CC and NC are 
subscores of P. These correlations, as 
well as some others, are included to 
complete the table. Since P=CC 
+ NC, multiple correlations among P, 
CC, and NC are necessarily unity. 

Considering the initial block of trials 
only, we note first that the correlation 
between CC and NC is essentially zero. 
Subjects’ “ability” of retaining items 
from one trial to the next does not 
seem to be related to their ability of 
retaining items within a trial. 

The second observation of interest 
in Table 2 concerns the positive and 
significant correlation between CC and 
log SO, and the absence of any notable 
correlation between NC and log SO, 
in the initial block of seven trials. For 
any given subject, therefore, intratrial 
retention is independent of the amount 
of organization that the subject im- 
poses on the material, while intertrial 
retention seems to be directly related 
to SO. 

At the empirical level, SO refers to 
the subjects’ tendency to recall certain 
items in close temporal contiguity to 
one another. At the conceptual level, 
this tendency can be thought to repre- 
sent the formation and existence of 
higher-order memory units. It is as 
if the list items—all already in the 
memory storage prior to the experi- 
ment, as was argued earlier—are re- 
arranged in the storage in the course 
of trial-by-trial practice. Such rear- 
rangement manifests itself and can be 
described in a variety of ways—devel- 
opment of associations of the type that 
define the associative meaning of a 
word (Deese, 1962); clustering in 
terms of conceptual (Bousfield, 1953; 
Cohen, 1963), associative (Jenkins & 
Russell, 1952), or synonymic (Cofer, 


1959) categories; chunking, unitiza- 
tion, or recoding as envisaged by 
Miller (1956a, 1956b) ; construction of 
a plan, or creation of a hierarchical 
structure (Miller, Galanter, & Pri- 
bram, 1960); employment of various 
“mnemonic aids” as described, for in- 
stance, by Balaban (1910) and Bugel- 
ski (1962) ; ordering of items in recall 
according to a previously learned code 
such as the alphabet (Tulving, 1962b) ; 
and probably many others. Subjective 
organization is just a general name for 
all of these processes. To the extent 
that the higher-order memory units 
that result from organization are not 
specified experimentally and to the ex- 
tent that the nature of the units is 
determined by the subject’s previous 
experience, it might be justifiable to 
refer to these units as subjective units 
(S units). The functional significance 
of the development of these units lies 
in the increased accessibility of indi- 
vidual items constituting a unit. An 
item can be retrieved on its own 
merits, or through other items in its 
higher-order S unit. 

If we accept the assumption that the 
measure of SO reflects the extent to 
which initially “unrelated” list items 
are formed into higher-order S units, 
it becomes possible to regard the in- 
creasing intertrial retention as reflect- 
ing nothing more or less than the 
increasing size of S units, and to en- 
tertain the hypothesis that the number 
of S units mediated by intertrial reten- 
tion remains essentially constant over 
trials. This hypothesis agrees well 
with Miller’s (1956a, 1956b) concep- 
tion of memory system that is limited 
by the number of units that can be 
retrieved in succession without inter- 
vening external instructions, but not 
by the information content of these 
units. There is already some evidence 
that large differences in first-trial free 
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recall become very much smaller or 
even disappear completely when reten- 
tion is measured in terms of S units 
rather than in terms of grammatically 
designated units such as individual 
words (Tulving & Patkau, 1962). 
The hypothesis proposed here to ac- 
count for the observed intertrial reten- 
tion simply suggests that handling of 
information over successive trials in a 
verbal learning experiment is governed 
by the same laws of invariance as is 
handling of information within a sin- 
gle trial. 


SuMMARY 


1. A subject attempting to memorize 
a list of previously integrated items 
under the conditions of a multitrial 
free recall experiment “learns” every 
single item at the time of its presen- 
tation. Because of rapid intratrial for- 
getting only some items can be re- 
called in the output phase of the first 
trial. The number of such items rep- 
resents a measure of intratrial reten- 
tion. 

2. Recall on the second trial, as well 
as on all subsequent trials, can be 
analyzed into two components—intra- 
trial and intertrial retention. Intratrial 
retention refers to retention of items 
from the input phase of the trial, inter- 
trial retention refers to retention of 
items from the output phase of the 
previous trial. 

3. The traditional learning curve 
can be expressed as an additive func- 
tion of intratrial and intertrial reten- 
tion curves plotted against trials. For 
many samples of data, one of which 
was considered in detail in the paper, 
intertrial retention increases as a log- 
arithmic function of trials, whereas 
intratrial retention decreases as a linear 
function of trials. The logarithmic 
learning curve of the form, P = a log 
n — bn + c, not only describes the re- 
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call performance quite adequately, but 
also makes explicit the two components 
of performance, a log n representing 
intertrial retention and c= bn repre- 
senting intratrial retention. 

4. Although the intratrial retention 
component of performance has been 
observed to decrease as a function of 
trials, it can be argued, on the basis of 
several kinds of evidence, that this 
decrease is an artifact attributable to 
limited list length used in most experi- 
ments, and that in an unconstrained 
task involving memorization of a list 
of words, intratrial retention is in- 
variant at all stages of practice. 

5. Supported by the finding that 
intertrial retention is positively cor- 
related with the amount of organiza- 
tion that subjects impose on their re- 
call, the hypothesis was advanced that 
the increase in intertrial retention re- 
flects the increase in the size, but not 
the number, of the higher-order sub- 
jective units of material which can be 
retrieved from storage. 
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A NOTE ON THE INCONCLUSIVENESS OF ACCEPTING 
THE NULL HYPOTHESIS 


WARNER R. WILSON anp HOWARD MILLER? 


University of Hawaii 


When an experimenter predicts exactly the differences to be expected, 
his choice of analysis determines whether acceptance or rejection of 
a null hypothesis will lend support to his theory. He makes a similar 
choice when he decides to study a situation in which his theory pre- 
dicts no differences vs. one in which some differences are predicted. 
It is better to make both choices so that support is based on rejection 
rather than acceptance of a null hypothesis because, in both cases, a 
rejection-support strategy enables one to minimize and quantify the 
danger of accepting an erroneous theory even in imprecise experiments. 


Grant (1962) and Binder (1963) have 
recently discussed the utility of basing 
support for a theory on the acceptance of 
a null hypothesis (Ho) versus basing 
support on the rejection of a Ho. It 
is convenient to refer to the procedure 
where acceptance of a Ho leads to sup- 
port for a theory as acceptance-support 
(a-s), and to the procedure where sup- 
port for a theory comes from rejection 
of a Ho as rejection support (r-s). Both 
writers have contributed to the clarifica- 
tion of the issue, but ambiguity and 
disagreement still remain. 

Before discussing the problem it may 
be desirable to discuss some of the terms 
to be used. This paper follows the lead 
of Grant (1962) in viewing the Ho as 
a supposition to the effect that no differ- 
ence or no relationship exists or that no 
effect is exerted. Seemingly the term 
null hypothesis is often used in this way, 
and certainly Grant (1962) used it in this 
way when he first stated the issue. It is 
therefore, in this sense, a statistical hy- 
pothesis to whose rejection alone a proba- 
bility estimate may be placed. As Binder 
points out, those of the Yule-Fisher 
school view the investigator as having 
two alternatives: he either rejects the 
Ho or reserves judgment. Those follow- 
ing the lead of Neyman and Pearson 


1 Now enrolled in the School of Medicine 
at Western Reserve University. 


either reject the Ho or accept it.? The 
Yule-Fisher school would seem more 
compatible with a-s since in a-s one 
either rejects the Ho or withholds judg- 
ment. Likewise, the Neyman-Pearson 
school seems more compatible with a-s 
since it allows acceptance of the Ho, and, 
in a-s it is the acceptance of the Ho that 
provides support for a theory which pre- 
dicted no difference in the first place. 
However, any reasonable person would 
concede that if the theory predicts no dif- 
ference, the failure to find a significant 
difference gives some comfort to the 
theory, at least relatively speaking. It 
would seem, therefore, that it makes little 
difference which of these views one takes. 

This paper suggests that the issue of 
a-s versus r-s strategies is broader than 
Grant and Binder indicate. It is hoped 
that the additional perspectives presented 
here will add further clarification and 
help provide grounds for the resolution 
of the issue. Grant and Binder discuss 
primarily a strategy decision which must 
be made in relation to the analysis of 
data. This paper points out that a simi- 


lar decision must be made in relation to 


the collection of data. 
In some cases, experimenters employ 


theoretical models to predict expected 


2Binder’s (1963) article provides a more 
detailed and well-documented discussion of 
these two viewpoints. 
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outcomes exactly. Grant and Binder di- 
rect most of their attention to situations 
of this kind. They point out that when 
evaluating the correspondence between 
predictions and outcomes, the experi- 
menter has at least two choices: he can 
test the Ho of no difference between pre- 
dicted and actual values, or he can test 
the Ho of no significant correspondence 
between observed and predicted values. 
In the first case, support for the theory 
comes from acceptance of a Ho; in the 
second case, support comes from rejec- 
tion of a Ho. It may be noted that in 
this context the experimenter commits 
himself to an a-s or an r-s strategy when 
he decides how to analyze his data. It 
may be convenient to refer to the two 
types of analyses as acceptance-support- 
analysis (a-s-a) and rejection-support- 
analysis (r-s-a). A typical a-s-a would 
involve showing, perhaps by means of 
t-tests, that a number of predicted points 
do not deviate significantly from the 
points predicted by a theory and conclud- 
ing that the theory is, therefore, sup- 
ported. An r-s-a of the same data would 
involve showing a significant correspond- 
ence between predicted and observed 
points by means of correlation or analysis 
of variance as discussed by Grant. 
Grant (1962) makes several objections 
to a-s-a: It is seldom, in psychology at 
least, that an experimenter can have 
real faith in the accuracy of his theory. 
Hence, if an experimenter finds no differ- 
ence between theoretical and empirical 
values, that is, if a-s is obtained, this 
result usually merely indicates that the 
experiment was not sufficiently sensitive 
to indicate the imperfections in the theory 
that are almost certainly there. Further, 
the more insensitive the experiment, the 
more likely it is to “support” a theory 
even if the theory is poor. When one 
seeks r-s, however, insensitivity works 
against acceptance of a theory. Further- 
more, it is also difficult to give a satis- 
factory interpretation to a failure to ob- 
tain a-s because, if an experiment is very 
sensitive, it may deny a-s to a theory 
which is really fairly good. When an 
experimenter seeks r-s, however, sensi- 
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tivity increases his chances of obtaining 


support. 
Binder (1963, cf. p. 111) claims that 
parallel objections can be made about 
rsa. An r-s-a may fail to support a 
good theory if the experiment is very 
insensitive; and, at the same time, if the 
experiment is very sensitive, an r-s-a may 
lend support to a worthless theory, that 
is, one that has negligible predictive 
utility. Grant's arguments, says Binder, 
do not add up to an objection against 
an a-s strategy; they add up to an 
objection against experiments that are 
either too insensitive or too sensitive, 
Experiments whose sensitivity is appro- 
priate to the present stage of theory con- 
struction and application will enable an 
experimenter to accept helpful theories 
and reject useless ones no matter which 
type of analysis he applies. 
While this last point may be sound 
enough logically, it does not help much 
practically, since it would seem to be 
rather difficult to establish any workable 
formula for deciding what is too precise 
or imprecise in a given setting. Better 
to have a strategy that protects against 
the more serious type of error even if 
optional sensitivity is not maintained. 
Binder (1963) does make a contribu- 
tion, however, in that he brings to our 
attention that we can make several differ- 
ent kinds of mistakes. We can (a) ac- 
cept a false theory; (b) withhold judg- 
ment about or reject a good theory; or 
(c) accept a “true but poor” theory (as 
discussed by Binder, 1963, pp. 111-112). 
It may be helpful to note the relation 
among choice of a-s-a or r-s-a, use of 
sensitive or insensitive experiments, and 
the type of error one is subject to: 
In an imprecise experiment a-s-a leads 
to Error Type a 

In a precise experiment a-s-a leads to 
Error Type b 

In an imprecise experiment r-s-a leads 
to Error Type b 

In a precise experiment r-s-a leads to 


Error Type ¢ 


The choice between a-s-a and r-s-a would 
seem to hinge on an evaluation of the 
seriousness of these different mistakes, 
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Traditionally, Type a is considered much 
more serious than Type 6. Campbell 
(1959) has presented a very sophisticated 
justification of this time-honored assump- 
tion. Ordinarily experimenters pay no 
attention to Type c; most experimenters 
are content to base r-s on a significant 
t or F and to worry not at all if the 
effect is so small as to be negligible. It 
is apparently quite rare in psychology for 
an experimenter to insist that a differ- 
ence be larger than some minimum before 
it is accepted as important. And, many 
psychological theories are able to predict 
only that differences, of unspecified mag- 
nitude, should or should not occur. Some 
even argue that small effects may be very 
important from the viewpoint of theory 
development and maintain, in effect, that 
when one reports a significant but very 
small effect he is not making an error at 
all but being highly virtuous. Nonethe- 
less, it may be unwise to give so little 
heed to the size of effects, that is, to 
Error Type c. It could be argued that 
psychologists may be spending a great 
deal of their time studying variables bet- 
ter viewed as irrelevant. This would 
seem to be more a fault of interpretation 
of a detected relationship than its detec- 
tion itself. Finding a relationship is still 
a virtue while misusing it may not be. 
One could even argue that Error Type 
a, acceptance of a false theory, is much 
the same as Error Type c, acceptance of 
a theory that is true as far as it goes but 
which is able to predict only a negligible 
part of the variance in the data. Indeed, 
Binder presents much of his defense of 
a-s on the fact that while a-s may lead 
to a, r-s has a corresponding weakness 
in that it may lead to c. Even if it is 
granted that Error Types a and c are 
similar in their consequences—and this 
is a concession that psychology as a 
whole apparently does not make—one can 
still note grounds for preferring precise 
experiments and an r-s analysis to im- 
precise experiments and a-s analysis. For 
one thing, human nature being what it 
is, more experimenters will probably be 
tempted to do imprecise experiments than 
precise experiments. One might argue, 
however, that this consideration is be- 
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side the point. More important is the 
fact that the precise experiment neces- 
sarily provides data to indicate how ade- 
quate the theory in question is. To be 
sure, it is a peculiarity of psychological 
data reporting that significance of rela- 
tionship is always indicated while degree 
of relationship usually is not. Nonethe- 
less, if the degree of relationship is 
slight, this can readily be inferred if the 
appropriate computations, or even inspec- 
tions, are made. The experimenter may 
accept a theory that predicts very little 
but he is not likely to be seriously deluded 
about how much it predicts. One is much 
more in the dark with the imprecise a-s-a, 
for in this case the data show that the 
theory may be perfect, but, in the absence 
of r-s-a it may, as far as anyone can tell, 
also be very poor or even completely 
worthless. 

These various considerations would 
seem to support Grant’s contention that 
the usual acceptance-rejection tactics 
might well be replaced or at least supple- 
mented by a quality control procedure 
which would indicate not merely if theo- 
ries predict but how well. The above 
considerations would not, however, seem 
to support Binder’s contention that a-s 
and r-s expose the experimenter to com- 
pletely parallel and equal serious pitfalls. 
Indeed, in light of the various stated rea- 
sons for viewing Error Type a as more 
serious than Error Type c (which may 
not even in itself be an error), it would 
seem that an evaluation of the various 
considerations Binder raises would lead 
to a preference for an r-s rather than an 
a-s strategy. 

Perhaps some different perspectives 
can clarify further the basis for this 
preference. Suppose the same set of data 
provides both types of support or neither. 
In such cases it would make little differ- 
ence which analysis was applied. But 
suppose the data provide r-s but not a-s. 
Such data can be meaningfully inter- 
preted as lending the theory some sup- 
port as Grant (1962) has pointed out. 
The critical question is, “What are the 
implications of accepting a-s when 788 
cannot be claimed?” It is suggested that 
a-s without r-s means nothing. This re- 


— . a a a 


THEORETICAL Nores 


sult can seemingly only occur when the 
experiment is too insensitive to justify 
any conclusion. Although an a-s analy- 
sis may sometimes be a useful or interest- 
ing supplement to the r-s analysis, Grant 
(1962) seems justified in maintaining 
that a-s alone is not enough. To state 
this conclusion in another way: a-s-a is 
adequate in some situations but in all 
these cases r-s-a will be adequate also, 
however, in certain other situations, 
which apparently cannot always be an- 
ticipated, r-s-a will be adequate but a-s-a 
will lead one astray. Hence, why use 
a-s-a? 

Thus far this paper has considered only 
cases in which the experimenter is com- 
paring the fit between data points and 
theoretical expectations that take the form 
of exact quantitative predictions, e. g., the 
correspondence between an empirical and 
predicted learning curve. It is hopefully 
clear that two different approaches can 
be taken to the analysis of such data, the 
acceptance-support-analysis or the rejec- 
tion-support-analysis. 

Obviously many, if not most, experi- 
ments in psychology do not involve exact 
predictions. Often it is only the direc- 
tionality of results that is predicted, e. g., 
the experimental group is expected to be 
less than, greater than, or perhaps equal 
to the control group. Do the issues 
raised by Grant and Binder in the con- 
text of exact quantitative prediction have 
any relevance to experiments that test 
theories which predict only whether 
groups are less than, greater than, or 
equal to each other? It is suggested that 
this second context also allows the ex- 
perimenter to adopt an a-s or an tes 
strategy and that the arguments favoring 
r-s are just as compelling in this situa- 
tion. 

It is true that when prediction is not 
exact, the kind of analysis applied will 
almost necessarily be analogous to what 
has been discussed as r-s-a. Suppose one 
predicts that the experimental group will 
exceed the control group. The Ho states 
no difference. Support for the prediction 
comes from rejecting this Ho. Applica- 
tion of an a-s-a, in this context, would 
involve conceptualizing the Ho as fol- 
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lows: “The groups do not differ signifi- 
cantly from the rank order predicted.” 
A-s is obtained unless the groups differ 
significantly in the direction opposite that 
predicted, One, of course, is not likely 
to think of an a-s-a in this particular 
context because, in this context, a-s is 
ridiculously easy to obtain. Perhaps, 
however, this fact only lends support to 
the assertion made earlier—a-s without 
r-s means nothing. 

Even when predictions are only direc- 
tional and an r-s-a is used, the experi- 
menter can still adopt an acceptance ver- 
sus a rejection support strategy by choos- 
ing to design his experiment so that his 
theory predicts no differences versus some 
differences. It will be convenient to refer 
to the no-difference experiment as an 
a-s-design (r-s-d) and to the some-dif- 
ference experiment as an r-s-design 
(r-s-d). If the experimenter chooses an 
a-s-d, he can claim support for his theory 
if no differences are found and at the 
same time present his negative results 
as evidence against any rival theories that 
do predict a difference. Rock and others 
(Rock, 1957; Rock & Heimer, 1959; 
Rock & Steinfeld, 1963) have used just 
such a strategy. If the experimenter 
wishes to use an r-s strategy, he adopts 
an r-s-d; and if differences are obtained, 
he claims support for his position and 
evidence against any theories that pre- 
dict no difference. Wilson (1962) has 
used just such a strategy. The experi- 
menter may, of course, change strategies 
in midstream. For example, if an experi- 
menter fails to get r-s, he may adopt a 
new theoretical position that predicts no 
difference and cite his data as giving a-s 
to it. When Thorndike reformulated his 
law of effect on the basis of negative 
results, he essentially changed from an 
r-s to an a-s strategy. Seemingly, an 
experimenter could, in theory, start out 
without a theory and formulate one post 
hoc, in which case he might claim either 
a-s or T-S. 

The objections made to a-s-a seem to 
apply with equal or greater force to 
a-s-d. In both cases imprecision increases 
the danger of accepting an erroneous 
theory, whereas both r-s-a and r-s-d pro- 


242 


tect against erroneous conclusions even 
in imprecise experiments. 

A further important and compelling 
point would seem to be that in all of 
these cases we are using statistics because 
empirical hypotheses do not find absolute 
acceptance (or rejection). Rather they 
are accepted with more or less confidence 
depending on the adequacy of the evi- 
dence. Indeed, the main point of infer- 
ential statistics would seem to be to 
attack probability statements to the ac- 
ceptance of ideas. Unless we are able 
to predict a necessary degree of corre- 
spondence or difference (which is rare 
in psychology), we are only able to at- 
tach such a probability statement to the 
rejection of the null hypothesis. For 
example, in a directional empirical pre- 
diction we can say that 1 or 5% of the 
time (as we choose) we will be wrong 
in rejecting the null hypothesis on the 
basis of such data as these. We cannot, 
however, conversely say what the proba- 
bility is of falsely accepting it except for 
the unlikely event in which we can 
specify a necessary magnitude. There- 
fore, only an r-s strategy will supply an 
exact estimate and an a-s strategy allows 
one to “accept” his hypothesis only in 
the negative sense of having found no 
evidence against it. This does not seem 
to be very satisfying grounds especially 
when error variance is large. 

A final consideration has to do with 
the nontheoretical value of the informa- 
tion obtained from an a-s-d versus an 
r-s-d. It is perhaps not undue to ask of 
an experimenter what value his informa- 
tion has other than its implications for 
choosing between theories. It may be 
noted that a-s-d seem to commit one to 
the cataloging of ineffective variables and 
procedures, Very likely a cataloging of 
effective variables and procedures, which 
is the logical result of r-s-d, would be 
more useful. 
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SUMMARY 


A choice can be made between an 
acceptance-support and a rejection-sup- 
port strategy when prediction is merely 
directional as well as when it is exact. 
When exact quantitative predictions are 
derived, one chooses an acceptance-sup- 
port versus a rejection-support strategy 
by choosing an acceptance-support analy- 
sis versus a rejection-support analysis. 
When only the directionality of the out- 
come is predicted, a rejeetion- support 
analysis is almost sure to be used, but one 
can still make the choice of an accept- 
ance-support versus a rejection- support 
strategy by choosing an acceptance-sup- 
port design versus a rejection‘ $ 
design. In both cases rejection-' 
seems to be the better strategy, eset 
in that it enables one to minimiz 
quantify the danger of accepting 
roneous theory even in imprecise experi- 
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A COMPARISON OF DATA OBTAINED WITH 
DIFFERENT FALSE-ALARM RATES! 


DONALD A. NORMAN 
Harvard University 


This note examines the comparison of performance measures when both 
successes (hits) and failures (false alarms) must be considered. A simple 
graphical procedure is developed that allows comparison to be made 
among experimental conditions with very limited assumptions about the 


underlying processes. 


One recurring analytical problem is 
the comparison of performance measures 
when both successes (hits) and failures 
(false alarms) must be considered. An 
example of such a comparison occurs in 
the analysis of a choice experiment with, 
let us say, two possible responses and two 
stimulus conditions. If Response 1 is 
correct only for Stimulus A and Re- 
sponse 2 only for Stimulus B, then the 
results of the experiment are summarized 
by the relative frequencies of correct 1 
responses—hits—[p(1|A)] and false 1 
responses false alarms—[p(1|B)]. The 
problem is to compare the results from 
Experimental Condition I, (r(A), 
Êr(1|B)) with the results from a second 
Condition II, (Arr(1]A), frr(1|B)) in 
order to determine which condition gives 
the superior performance. 

This problem arises in many contexts, 
for example: 


1. In recognition memory, the prob- 
ability of recognizing old items from a 
list of old and new items is compa 
for several different experimental vari- 
ables (Davis, Sutherland, & Judd, 1961; 
Woodworth & Schlosberg, 1954) ; 

2. In psychophysics, the performance 
of an observer detecting (or identifying) 
a signal is compared for several different 
experimental variables (Green, 1960); 

3. In information retrieval, several 
library search procedures are to be 
evaluated and compared when the differ- 


1 This work was performed while the author 
was a National Science Foundation Post- 
doctoral Fellow at the Harvard Center for 
Cognitive Studies. I would like to thank 
Wayne Wickelgren for his helpful suggestions. 
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ent systems have differing correct and 
incorrect retrieval rates (Swets, 1963). 


Early attempts to make this type 
of comparison assumed that errors 
CÊ(1|B)] and [A(2|A)] were either to 
be avoided or to be corrected (Wood- 
worth & Schlosberg, 1954). Two types 
of correction formulas have been popular: 


“true rate“ = (proportion correct) 
— (proportion incorrect) [1] 
or 


“true rate” =[H(1]A)—A(1|B)]/ 
150150]. [2] 


These two methods are not only differ- 
ent, but are also conflicting; it is quite 
easy to find pairs of numbers such that 
by using Equation 1, Condition I is 
“better,” while by using Equation 2, 
Condition II is “better.” These two 
correction techniques, the assumptions 
underlying them, and clear demonstra- 
tions of their incorrectness in recognition 
memory, audition, and vision have been 
studied by a number of investigators, 
among them Egan (1958) and Swets, 
Tanner, and Birdsall (1961). 

Recent developments in statistical 
decision theory and psychophysics have 
provided an answer to the comparison 
problem when the complete operating 
function (ROC curve) relating Ê(1|A) 
to (|B) is known (see the references 
in Swets, Tanner, & Birdsall, 1961). 
Unfortunately, it is often impossible or 
impractical to determine this relation. 
In some fields of study there are mathe- 
matical models for these functions, but 
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unfortunately, whenever there is one 
model there are usually three or four 
models, all different, but all equally 
good descriptions of the same data. 


Basic ANALYSIS 


The purpose of this note is to examine 
some comparisons that can be made 
among the data with very limited theo- 
retical assumptions. To do this, first 
consider what we mean by one condition 
being better than another. There are 
two requirements that seem reasonable. 
First is that a condition (I) is superior 
to another (II) if it leads to a hit rate 
that is better, and a false alarm rate that 
is lower, than that of the other condition. 
The second requirement is that any 
differences in the performance between 
the two conditions could not have oc- 
curred by chance alone. 

Figure 1 shows the unit square repre- 
senting the possible hit-false- alarm pairs. 
The minor diagonal (the dashed line) 
represents chance behavior—points be- 
low the diagonal are worse than chance 
behavior, and so are clearly inferior to 
points above the diagonal.? Now, let us 


2 Although points below the line of chance 
behavior are usually considered inferior to 
those above the line, they still represent 
efficient discrimination. For example, were 
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fix the. point. from” Condition I (hereafter 
referred to asl) in some arbitrary loca- 
tion and consider the regions in the unit 
square in which a second point (II) 
must lie in order to be unequivocally 
superior or inferior to Point I. By the 
first of our two requirements, Point II 
is inferior if it lies in the rectangular 
space below and to the right of Point I. 
By the same argument, if Point II lies 
anywhere in the rectangular space above 
and to the left of Point I, then II must 
be superior to I. These regions are 
illustrated in Figure 1. 

Now consider the ways in which the 
location of the points fluctuate due to 
chance (or any behavior such as guessing 
that is independent of the conditions 
under study). If the number of “1” 
responses increases by chance, completely 
independent of the stimulus conditions, 
then Point I moves to the new Location 
I’ with coordinates 


br (1|4)=fr (11A) Ar (1|4)] 


and 
br (1|B)=fr(4|B)+¢(1 —Ar(1|B)I, [3] 


where 0 StS 1 is the proportion of the 
increase. Equation 3 indicates that 
Point I’ moves along the straight line 
segment connecting Point I with the co- 
ordinates (1, 1) as ¢ varies from zero to 
one. Similarly, if by chance the number 
of “2” responses should increase by some 
proportion u, completely independent of 
the stimulus conditions, then Point I 
moves to the new Location I” with 
coordinates 


Êr: (114) = upr(1|A) 
pr (1|B) = upr(1|B), [4] | 


where 0< u <1. Equation 4 deter- 
mines the straight line segment connect- 
ing Point Į with the coordinates (O, O). 
Now, if Point II should lie either 
along the lines of Equations 3 and 4 


the subject’s responses always wrong, the 
obtained point would be at the lower right 
corner (1,0): signifying excellent—although | 
somewhat perverse—performance. 
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or below and to the right of these lines, 
it cannot be superior or equivalent to 
Condition I by our second requirement: 
the same or better result could have been 
obtained from Condition I by chance 
alone. So we may add this area to the 
region labeled “inferor” in Figure 1. 

By a similar argument, Point II is 
superior to I if it is located in the unit 
square so that Point I lies within the 
area enclosed by the lines connecting II 
with (0,0) and (1, 1). It is easily seen 
that the region in which II will be 
superior to I is defined by the area above 
and to the left of the continuations of 
the line segments of Equations 3 and 4 
(see Figure 1). Note that the argument 
is entirely equivalent to requiring that 
the operating characteristic be mono- 
tonic from (0, 0) to (1,1) with non- 
increasing slope. At any Point I, the 
slope of the operating characteristic is 
bounded by the slopes of the line con- 
necting I with (0,0) (as a maximum) 
and the line connecting I with (1, 1) 
(asa minimum). The two points cannot 
possibly lie on the same curve unless 
they both lie in the region bounded by 
the lines. 

As we can see in Figure 1, we can 
make some good inferences on the rela- 
tionship between two experimental con- 
ditions without any strong assumptions 
about the underlying processes. The 
unshaded areas in Figure 1 are the 
ambiguous regions where, in the absence 
of more data or theories, no conclusion 
can be reached. 
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Much stronger restrictions on the unit 
square can be determined if a second 
point is available for Condition I. With 
two points, Iı and Is, the regions where 
Condition II is inferior or superior to 
Condition I is obtained by the disjunc- 
tion of the corresponding areas de- 
termined separately for each point. (If, 
indeed, the two points come from the 
same performance condition, there should 
be no conflicting regions.) This new area 
is illustrated in Figure 2 by the diagonal 
shading. 

If both Points Iı and Ia can result 


wo 
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Fic. 2. Comparison of data with two 
points from one condition. (The shaded areas 
illustrate where Condition II is superior or 
inferior to Condition I.) 


from Condition I, then intermediate 
points can also be obtained if perform- 
ance alternates between the two points. 
In this case, any point on the straight 
line segment connecting I, and Is can 
be obtained in Condition I. Thus, if 
Point II were located in the triangle 
bounded on the upper left by the Line 
Ir—ls it must be considered inferior to 
I. The addition of the second point, Is, 
reduces the ambiguous region in the 
unit square considerably, as shown in 
Figure 2. Note that if two points are 
available for both conditions, not only 
does Figure 2 apply but the analytical 
power is strengthened. The analysis of 
Figure 2 should be applied twice, once 
for the pair of points from each condi- 
tion. The results are ambiguous only if 
all the points from both analyses are 
ambiguous. 


OTHER CONSIDERATIONS 


The regions illustrated in the figures 
are derived from an exact knowledge of 
the points. These, of course, are seldom 
known; the empirically determined val- 
ues are but statistical estimates of the 
true locations. The confidence intervals 
for the estimates of the two coordinates 
of each point determine confidence 
ellipses which surround the points. The 
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confidence limits for the boundaries of 
the inferior and superior regions can be 
determined by drawing the lines of 
Figures 1 and 2 tangent to the outer 
edges of the ellipses. These confidence 
limits can then be used to determine the 
effect of statistical variability on the 
comparison of the data points. 

One basic assumption remains to be 
stated. It has been assumed that the 
true underlying operating characteristics 
Whatever their shape or location—do 
not cross one another. This assumption 
is weak for two reasons. One, if the 
operating characteristics do cross, the 
concepts of superior or inferior become 
strained and perhaps even meaningless. 
Two, to my knowledge, none of the 
varied theories or data on operating 
characteristics suggest such a possibility. 


CONCLUSION 


The analyses in this note can easily be 
extended to more data points and more 
stimuli and responses. The important 
result is that experimental conditions 
can often be ranked with respect to 
performance without any assumptions 
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about the underlying mechanisms or 
artificial correction techniques. Exact 
knowledge of the operating character- 
istic is always the most desirable solu- 
tion and supersedes these techniques. 
Unfortunately, such information is not 
always available. 


REFERENCES 


Davis, R., SUTHERLAND, N. S., & Jupp, B. B. 
Information content in recognition and 
recall. J. exp. Psychol., 1961, 61, 422- 
429, 


Ecay, J. P. Recognition memory and the 
operating characteristic. Technical Note, 
Contract No. AF 19(604)-1962, 15 June 
1958. Bloomington: Indiana Universi 
Hearing and Communication Laboratory. 

GREEN, D. M. Psychoacoustics and detec- 

tion theory. J. Acoust. Soc. Amer., 1960, 

32, 1189-1203. g 

Swets, J. A. Information retrieval systems. 
Science, 1963, 141, 245-250. 

Swets, J. A., TANNER, W. P., JR., & BIRDSALL, 
T. G. Decision processes in perception. 
Psychol. Rev., 1961, 68, 301-340. 

WoopwortH, R. S., & ScHLOSBERG, H. 
Experimental psychology. New York: Holt, 
1954. P. 700. 


(Received November 1, 1963) 


VoL. 71, No. 4 


PSYCHOLOGICAL - REVIEW 
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In experiments dealing with variations in CS intensity in eyelid 
conditioning and in signal intensity in simple reaction time, it was 
determined that substantially greater effects are obtained if individual 
Ss are exposed to the different intensities than if each S experiences 


g 
JuLy 1964 


only one intensity value. 


from experimental designs whi 


Stimulus intensity generalization experimen! 
substantial intensity effects. The data 


type which should produce 


tend to support a dynamogenic theor: 


fixed functions of stimulus energ 
tion must be considered. 


The intensity of the CS has usually 
been found to be a relatively ineffective 
variable in human conditioning. In 
eyelid conditioning, Carter (1941) and 
Grant and Schneider (1948) found no 
evidence that this variable reliably af- 
fected either response strength or the 
associative level of conditioning. 
Grant and Schneider (1949) also 
failed to find significant CS intensity 
effects in GSR conditioning. The 
only human data cited by Hull (1949) 
in his presentation of the stimulus 
intensity dynamism (V) were studies 
of signal intensity in reaction time 
(Cattell, 1886; Piéron, 1920) and the 
well-known GSR study of intensity 
generalization by Hovland (1937). 
Walker (1960) reported that the in- 
tensity of an auditory CS reliably af- 
fected the frequency of eyelid re- 

1This work was supported by Grant 


G-14223 from the National Science Foun- 
dation. 


Different results are thus to be expected 


ch use these different procedures. 
ts are inherently of a 


y which does not depend upon 


y, but in which the level of adapta- 


sponses of the voluntary form, and 
conditioned responses of short latency 
only. However, the effects which she 
reported were rather small. More re- 
cently, Beck (1963) has obtained a 
result in eyelid conditioning which, in 
view of this earlier literature, is quite 
remarkable. In an investigation of 
possible interaction effects between CS 
intensity, UCS intensity, and drive 
level as inferred from an anxiety scale, 
she included the CS intensity variable 
in a factorial design as a within-Ss 
effect. Two intensity values were used 
and were both administered to all Ss 
in an irregular order throughout 100 
conditioning trials. This was done in 
order to achieve the statistical pre- 
cision of this design, since the effect 
was assumed to be rather weak. The 
result was that the CS intensity vari- 
able not only reached statistical sig- 
nificance, but produced an effect which 
was very large in absolute terms. In 
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fact, the difference produced by a 50- 
decibel difference in CS intensity was 
approximately the same size as that 
produced by values of .5 and 5.0 psi 
in the strength of the air-puff UCS. 
These represent rather extreme values 
of a variable well known to be highly 
effective in eyelid conditioning experi- 
ments, This large difference was con- 
sistent over all values of the other 
two variables. A rather inescapable 
conclusion seems to be that something 
more than statistical precision resulted 
from the within-Ss design. Appar- 
ently, the exposure of the Ss to the 
two intensities of the CS within the 
experiment served to increase the mag- 
nitude of the effect over what is ob- 
tained when S experiences only one 
intensity value. This is the case in 
the usual experiment in which a sepa- 
rate group of Ss is provided for each 
value of the variable under study. 
What is required to establish this 
conclusion firmly, is to conduct inves- 
tigations in which the two methods of 
Studying intensity functions are di- 
rectly compared. This means, in ef- 
fect, the comparison of a between-Ss 
experiment with a within-Ss where 
the same values of the independent 
variable are used in both. This turns 
out to be a reasonable, but neglected, 
experimental design in psychological 
research. The use of repeated meas- 
ures on the same Ss is demanded by 
some problems, such as the effect of 
increasing amounts of practice on 
learning. In other cases, however, 
within-Ss designs are used to achieve 
statistical precision through elimina- 
tion of individual difference variance 
and to effect economies in the number 
of Ss required. Such use of the de- 
sign usually involves, often implicitly 
the assumption that the treatment el. 
fects obtained are the same as would 
be obtained with a larger, less eco- 
nomical random groups design. As 
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suggested in the case of stimulus in- 
tensity, this is not necessarily so. 
Even though the values of an inde- 
pendent variable may be the same, the 
two types of experiment are not inves- 
tigations of the same problem. Be- 
tween- and within-Ss relationships are, 
in principle, independent, and the laws 
obtained may or may not be similar. 
Furthermore, as in the present in- 
stance, the difference between the 
two types of functions may represent 
a problem of considerable psychologi- 
cal interest. Two experiments have 
been conducted investigating the ef- 
fects of stimulus intensity under these 
different conditions. 


EXPERIMENT I 
Method 


The first experiment was an investigation 
of CS intensity in eyelid conditioning. Con- 
ventional recording and stimulus program- 
ing equipment was used (Grice & Davis, 
1960). The Ss were university under- 
graduate women, The basic parameters of 
the experiment were: a l-psi air puff as 
UCS; a .5-second interstimulus interval; an 
average intertrial interval of 20 seconds; 
and a buzzer warning signal at an average 
of 2 seconds before the CS. The CS was 
a 1000-cycle tone of .5-second duration, 
presented in earphones by means of an 
electronic switch with a 30-millisecond rise 
and decay time. Its intensity was either 
50- or 100-decibel SPL, One group of Ss 
(Group L) was conditioned for 100 trials 
with the loud tone only, and a second group 
(S) received the soft tone. Two groups 
(LS-L and LS-S) received 50 conditioning 
trials with each tone. The order of presen- 
tation was random with the restrictions that 
there be no more than 4 successive trials 
with one intensity and that there be 10 
presentations of each tone in successive 
blocks of 20 trials. Two orders were used, 
one the mirror image of the other formed 
by interchanging the loud and soft tones. 
Half of the Ss in each group had one 
order, and the other half, the other. Pro- 
cedure for the two LS groups was identical, 
but for purposes of statistical analysis, the 
loud tone data were used for LS-L and 
the soft tone data for LS-S. There were 
20 Ss in each group. y 


Results 


The results are presented and ana- 
lyzed in terms of percentage of antici- 
patory CRs. The major findings of 
the experiment are quickly seen from 
an inspection of Figure 1. The points 
connected by the broken line represent 
percent CRs on Trials 41-100 for the 
two groups having only one CS. The 
points connected by the solid line rep- 
resent the loud (LS-L) and soft 
(LS-S) tone data for the two groups 
which received both CS intensities. 
The magnitude of the intensity effect 
is more than five times as great for 
the condition in which Ss received both 
stimuli as compared with the single 
value condition. This interaction is 
also significant, /(1, 76) = 8.32, p< 
01. The combined intensity effect is 
also significant, F(1, 76)= 18.23, p 
<.001, but for the single stimulus 
condition alone it is not, £(76)= 0.98. 
Other individual group comparisons 
are also of interest. For the loud tone, 
response frequency is significantly 
greater for the two-stimulus condition 
than for the one, (76) = 2.29, p <.05. 
The inferiority of the two-stimulus 
condition on the soft tone does not 
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PER CENT CRs 


SPL (ed) 


Fic. 1. Percent CRs during last 60 trials 
to the loud and soft tones under the one- 
and two-stimulus conditions. 


reach significance, #(76)= 1.79, .10 > 
p> .05. In Figure 2 are presented 
the conditioning curves for the two 
single stimulus groups and data for 
the loud and soft stimuli for groups 
LS-L and LS-S, respectively. Data 
points are means of 20 trials for 
Groups L and S and of the appro- 
priate 10 trials for Groups LS-L and 
LS-S. Basically, these curves show 
that the phenomena illustrated in Fig- 


4 5 


Fic. 2. Conditioning curve data showing percent CRs to each stimulus under each condition 
of administration during successive 


blocks of 20 conditioning trials. 
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ure 1 begin early training and are 
undiminished through 100 condition- 
ing trials, In summary, the experi- 
ment unambiguously verifies the con- 
clusion drawn on the basis of the 
comparison between Becks finding and 
the previous literature. The exposure 
of Ss to two values of CS intensity 
during conditioning substantially in- 
creases the effect of the variable as 
compared with the use of single values 


with separate groups. 
EXPERIMENT II 


Hull's postulate concerning the 
dynamogenic effects of stimulus inten- 
sity was, in part, based on results 
from studies of signal intensity in 
simple reaction time, although he prob- 
ably did not use the best data avail- 
able, e.g., Chocholle (1945, 1948). 
these experiments clear and o erly 
functions have been obtained. It is 
of interest that Cattell’s (1886) data 
cited by Hull, as well as those of 
Chocholle, were obtained from re- 
peated measures on highly practiced 
Ss. Chocholle, for example, would 
run about 20 trials with one intensity 
and then shift to another. In other 
words, both employed within-Ss de- 
signs in which the Ss were exposed to 
all intensities. Similar experiments 
with separate groups of Ss would be 
a tremendous undertaking, but the 
question as to whether they would 
yield similar functions is unanswered. 
Another question of considerable in- 
terest is the effect of using blocks of 
trials of various lengths with a single 
stimulus as opposed to the use of an 
unpredictable order, but this problem 
must await future research. In order 
to investigate the generality of the 
above finding for eyelid conditioning, 
a similar investigation has been con- 
ducted with simple reaction time using 
two intensity values of an auditory 
signal. 


fd 
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of the trials at each 


The response consisted oí pressing 
a conventional telegraph key. Reat- 
tion signals consisted of 1000-cycle 
tones of I-second duration, and of 
40- and 100-decibel SPL. The 60 
decibel separation was used rather 
than the 50-decibel of the eyelid study 
because Chocholle’s data suggested that 
a wide separation was necessary in Org 
der to obtain substantial effects, if 
near-threshold values were to be 
avoided. The foreperiod signal was 
a buzzer and preceded the tone by 1 
1.5, and 2 seconds in an irregular 
order. Trials were run at approxi- 
mately a rate of 4 per minute. T here 
were 120 trials. The experimental 
groups were analogous to those of 
Experiment I. Groups L and S al- 
s received one signal intensity. 
LS:L and LS-S received hal 
intensity in 
unpredictable order. For Trials 21 
120, one of the random orders for Ex- 
periment I was used throughout. 
Conventional reaction time instructions 
were given. The Ss were women from 
the same population as for Experimen 
I. There were 20 Ss each in Groups 
and S, and 10 Ss each in Groups LS- 
and LS-S. Reaction latencies were 
recorded in milliseconds by means © 
an electronic counter. 

The data presented in Figure 3 are 
means of S medians for Trials 21 to 
120. Medians for the single-stimulus 
condition are based on all 100 trials. 
Those for the two-stimulus condition 
are based on the appropriate 50 trials 
It is clear that these findings are not 
entirely the same as those for the eye- 
lid experiment. However, the basic 
finding that the intensity effect is in- 
creased by exposure to both stimuli 
was again obtained. The slope for the 
two-stimulus condition is significantly 
steeper than for the one-stimulus 
condition, F(1, 56)= 4.48, p< .05. 
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Rather than the crossover effect found 
in the eyelid study, there is an overall 
slowing of reaction time in the two- 
stimulus condition. This effect is sig- 
nificant, F(1, 56)= 16.59, p < 001. 
This is produced primarily by the re- 
sponse to the soft tone where the dif- 
ference between the two conditions is 
significant, #(56) = 4.38, p < .01. The 
difference for the loud tone is not sig- 
nificant. The combined intensity ef- 
fect is significant, F(1, 56)= 42.97, p 
< .001, and even the smaller effect for 
the single stimulus condition alone is 
also significant, t(56)= 4.13, p < 01. 
Practice curves are presented in Fig- 
ure 4. After an initial improvement, 
the results are quite stable, and there 
is no indication that the interaction 
between intensity and the experimental 
conditions is decreasing with practice. 
Thus, in spite of certain differences in 
detail in these two situations studied, 
both experiments confirm the general 
principle that differential responsive- 
ness on the basis of stimulus intensity 
is increased by multiple stimulus 
resentation. 


REACTION TIME (msec) 


2 3 4 
BLOCKS OF 20 TRIALS 


Fic. 4. Practice curves for each stimulus un à 
2 conditions in the reaction time experiment. 


REACTION TE (msec) 


SPL (en) 


Fic. 3. Means of S median latency on 
the last 100 reaction time trials. 


THEORETICAL IMPLICATIONS 


These experimental findings are of 
considerable theoretical interest. It is 
at once apparent that Hull's stimulus 
intensity dynamism postulate will not 
account for them. He assumed that V 
was a simple function of stimulus en- 
ergy, but these experiments clearly 
show that the dynamogenic property 
of a given energy depends largely on 
what other stimuli are presented. It is 
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clear, then, that there must be no in- 
variant intensity functions. 

Another hypothesis, still within the 
framework of Hull’s theory, was inde- 
pendently proposed by Perkins (1953) 
and Logan (1954), and has been more 
recently elaborated by Champion 
(1962). These writers have suggested 
that the dynamism concept is super- 
fluous in Hull's system, and that in- 
tensity effects are deducible from other 
portions of the theory. This view is 
essentially a discrimination interpreta- 
tion. Making use of generalization 
gradients for excitation originating at 
the CS intensity value, and gradients 
of inhibition originating at the back- 
ground intensity, they were able to 
deduce greater net excitatory strengths 
for stronger stimuli. There are some 
problems involved in the application of 
this theory to the present situation. If 
we assume, on the basis of the condi- 
tioning curves, that all habit strengths 
are approximately at asymptotic level, 
and hence about equal, the theory pre- 
dicts equal differences between the 
two stimuli in the two situations. 
There is one feature of the eyelid con- 
ditioning data which is particularly 
embarrassing to this theory. The 
theory rather clearly predicts that the 
addition of a weaker stimulus should 
result in a weaker absolute response 
strength to the stronger stimulus. 
This is because the inhibitory gradient 
from the background intensity should 
start at a higher level. This predic- 
tion would hold unless the strong 
stimulus was not within the range of 
this gradient, in which case, its abso- 
lute strength would be unaffected. 
However, in the present eyelid experi- 
ment, the response strength to the 
loud tone was significantly increased 
by the presence in the experiment of 
the soft tone. Furthermore, this oc- 
curred in spite of the fact that it re- 
ceived only half as many reinforced 
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trials as in the single-stimulus situa- 
tion. 

These present data tend to suggest 
an interpretation in terms of a genuine 
dynamogenic effect associated with the 
intensity of the eliciting stimulus. Ob- 
viously, however, more than mere 
stimulus intensity is involved. It ap- 
pears that some form of contrast ef- 
fect is an important element in this 
phenomenon. One approach would be 
in terms of an adaptation-level concept 
similar to that of Helson (1959). 
This interpretation is particularly well 
able to handle the result of the condi- 
tioning experiment. For example, 
one could make the simple assumption 
that the dynamogenic potency of a 
stimulus depends upon its rahe 


from adaptation level (AL) rathe 
than upon absolute intensity. In th 
case of the single stimulus situation, 
then, the AL should be near the stimu- 
lus value. This would lead to the pre- 
diction of minimal intensity effects, 
since, irrespective of absolute intensity, 
departure from AL would be small. 
In the two-stimulus situation, how- 
ever, the AL might be expected to lie 
between the two values. With the 
weak stimulus below AL and the 
strong above, an exaggerated intensity 
effect would be expected. The crossed 
functions of the conditioning experi- 
ment lend considerable credibility to 
this interpretation. i 

While the reaction time experiment 
is in agreement with the conditioning 
study in producing a greater intensity 
effect for the two-stimulus condition, it 
also differs in that the functions do not 
cross. Rather, there is a general slow- 
ing of reaction time in the two-stimu- 
lus condition, the slowing effect being 
greater for the soft tone. This does 
not fit the adaptation-level interpreta- 
tion quite so neatly. However, it 
seems possible that, in this situation, the 
longer reaction times reflect a set phe- 
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nomenon resulting from the unpredict- 
ability of the stimulus. There is little 
support for this view in the disjunctive 
reaction time experiments. There, 
the latency seems to depend upon the 
amount of information transmitted 
(Bricker, 1955), and irrelevant in- 
formation in the stimulus which does 
not increase information transmitted 
has no effect, at least at high levels of 
practice (Morin, Forrin, & Archer, 
1961). There seem to be little data, 
however, on this point for simple re- 
action time where there is no response 
selection, and, in this sense, no infor- 
mation transmitted. Mowrer, Ray- 
man, and Bliss (1940) and Mowrer 
(1941) did, under some conditions, 
find that the occurrence of an unex- 
pected (high informational) stimulus 
resulted in a slower reaction even 
though there was no change in the 
response required. The present data 
certainly suggest this, and the matter 
needs further exploration. This inter- 
pretation suggests that the intensity 
functions would cross except for the 
presence of an additive constant due 
to the set variable. The data of Morin, 
Forrin, and Archer (1961) hint that 
the crossing might be obtained with ex- 
tremely extended practice. At least, 
these considerations indicate that the 
reaction time data can very possibly be 
subsumed under the adaptation-level 
interpretation, if the set explanation 
turns out to be correct. 

Whether or not adaptation-level 
theory will provide a fully adequate 
explanation for these phenomena is a 
matter for future decision. At least a 
number of experimental approaches 
are suggested by the findings and by 
the interpretation. Among the ex- 
perimental variables which should be 
of interest are: the number and range 
of intensity values employed; the rela- 
tive frequency of occurrence of each 
intensity used; the length of the trial 
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blocks in which a particular value is 
consecutively presented ; the degree of 
predictability in the stimulus presenta- 
tion sequence; the mere exposure of 
S to certain stimuli without reinforce- 
ment or requiring a response; and the 
role of background stimulation, Of 
course, experimental situations other 
than eyelid conditioning and reaction 
time should be investigated. The fact 
that variable stimulation leads to 
stronger and more persistent orienting 
reflexes, also suggests the possible 
relevance of that phenomenon. It is 
the intent that the present discussion 
be limited to situations involving hu- 
man Ss. However, there is the pos- 
sibility that the problem is somewhat 
different with animals and that stimu- 
lus intensity is a more effective vari- 
able with them (e.g., see Kimble, 1961, 
p- 120). If this is true, it may be that 
attention getting properties of strong 
stimuli are of more importance with 
animals than in most human experi- 
mentation. Perhaps this will suggest 
additional approaches to the problem. 
A word should be added about the 
problem of response evocation by 
stimulus onset versus stimulus cessa- 
tion. In general, it appears that, for 
a given intensity, it makes little dif- 
ference whether the critical event is 
onset or cessation, so far as response 
producing potency is concerned. 
(Champion, 1962; Chocholle, 1948; 
Logan & Wagner, 1962.) While the 
discrimination theory of Perkins and 
will account for this similarity, 

the doubt cast upon this interpretation 
by the present data suggests that the 
dynamogenic property of stimuli de- 
pends upon the amount of energy 
change in relation to Ss’ present con- 
dition of adaptation. The contention of 
Logan and Wagner that a dynamogenic 
theory implies greater effectiveness of 
onset is not a necessary part of such 
a theory. This is an interesting idea, 
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though, since it implies that a sudden 
decrease in stimulus energy will have 
dynamogenic properties, if this event 
is the condition for response evocation. 
Certainly, the cessation condition re- 
quires exploration with the within-Ss 
type of experiment as well as the onset 
condition. 
Stupres oF INTENSITY 
GENERALIZATION 


The present findings have a rather 
interesting implication for studies of 
generalization along intensity dimen- 
sions. Such experiments inherently 
involve the presentation of more than 
one stimulus to individual Ss. It is 
thus to be expected that larger inten- 
sity effects will be found in generaliza- 
tion studies than in independent group 
experiments investigating intensity di- 
rectly. It is apparently no accident 
that Hull relied so heavily on general- 
ization data in his discussion of the 
stimulus intensity dynamism, and this 
chief interest in the concept seemed 
to be its application to intensity gen- 
eralization functions. It is well known 


TABLE 1 


Anatysts or HovLanp’s (1937) Data on 
INTENSITY GENERALIZATION 


Source 


F 
Between subjects 
CS intensity (CS) 56 
Error T (149.90) 
Within subjects 
Test intensity (TI) 27.74** 
Cycles (Cy) 1.81 
TI X Cy 86 
TI X CS 3.00% 
Cy Xx CS 50 
TI xX Cy Xx CS 1.60 
Error (27.25) 
Errore (TI, TI X CS) (26.22) 
Errory (Cy, Cy X CS) (67.91) 
Errory (TI X Cy, 
TI X Cy X CS) (14.21) 
Note.—Error mean squares in parentheses, 
*p <.05. 
% < .001. 
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that generalization tests are conducted 
in different ways. The Ss may be 
tested at only one value, either by a 
single test trial or by a series of ex- 
tinction trials. Either of these meth- 
ods involves a single intensity shift, 
but the extinction method then in- 
volves a series on the new intensity 
without further change. On the other 
hand, each & may be tested at all in- 
tensities, usually in an unpredictable 
order, counterbalanced between Ss. 
There may be more than one cycle of 
test trials. There is reason for think- 
ing that these methods may not yield 
the same generalization gradients. 
The problem raised here suggests that 
they may not result in the same inten- 
sity effects either. Data are not avail- 
able for adequate predictions, but a 
likely speculation would be that the 
counterbalancing procedure would 
produce the greatest intensity effect, 
and the extinction procedure the least. 
The experiment by Hovland (1937), 
using the counterbalancing method, 
and that by Hall and Prokasy (1961), 
using a single test trial, produced re- 
liable intensity effects, while the Grant 
and Schneider (1948, 1949) experi- 
ments, using extinction at a single 
value, did not. 

The most widely cited experiment 
using the multiple test procedure is 
that of Hovland (1937). This experi- 
ment is still of considerable interest 
for several reasons. Briefly, Hovland 
conditioned 16 Ss on the GSR at 40- 
decibel loudness level, and a second 
group of 16 at 86 decibels. All Ss 
were then tested by means of extine- 
tion trials at 40, 60, 74, and 86 decibels 
in counterbalanced orders. Three 
cycles of test trials were conducted. 
While appropriate methods of analysis 
did not exist at that time, all data were 
published, and a complete analysis is 
presented here. The basic data, aver- 
aged over the three test cycles, are 
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presented in Figure 5, and the sum- 
mary of the analysis is presented in 
Table 1. The large effect of test 
stimulus intensity is apparent in the 
figure and is highly significant. This 
large and reliable intensity effect, as 
opposed to Grant and Schneider's 
(1949) negative finding for the GSR, 
is very likely due to the presentation 
of all stimulus intensities to each S as 
opposed to the Grant and Schneider 
procedure in which each S received 
only one training and one extinction 
intensity. This conclusion must be 
made with some reservation, however, 
since individual difference variance 
was not included in this comparison 
as it was in the Grant and Schneider 
experiment.? The difference in slope 
between the two functions of Figure 5 
may be interpreted as indicating true 
generalization decrement and is sig- 
nificant (TI X CS), but it accounts 
for a trivial portion of the variance. 
Hovland pointed out that the generali- 
zation gradient was flat on the first test 
cycle when averaged over training in- 
tensities, and that generalization decre- 
ment only appeared on the second and 
third test cycles. However, it turns 
out that this change in slope was not 
Statistically significant (TI x CS x 
Cy). 

Another feature of the Hovland de- 
sign is that it may be regarded as a 
test of whether the intensity of the 
CS during conditioning is a deter- 
minant of the associative strength of 


In the Hall and Prokasy (1961) experi- 
ment, only data for one test trial were 
Presented. However, eight extinction trials 
Were actually conducted with a single test 
intensity for each S. Unpublished data 
kindly provided by Hall (J. F. Hall, per- 
Sonal communication, 1963) indicate that the 
effect due to test stimulus intensity dimin- 
ished during extinction, From the second 
trial on, the overall intensity relationship 
Was no longer monotonic, and by the seventh 
trial, all traces of the effect had disappeared. 
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Fic. 5. Test trial data for Hovland's 
(1937) intensity generalization experiment. 
(Each point is mean of three test trials.) 


conditioning. In this respect, it 
is related to the original purpose 
of the Grant and Schneider ex- 
periments, and to the Spence (1953) 
experiment of UCS intensity. This 
experiment, however, involves the 
counterbalanced presentation of four 
intensities to each of two CS intensity 
groups, rather than using a separate 
group for each test intensity value. 
The result is consistent with those of 
Grant and Schneider and also of Hall 
and Prokasy (1961) in indicating no 
significant effect of CS intensity. In 
this connection, it is significant that 
Hull (1951, 1952) regarded the stimu- 
lus intensity dynamism during con- 
ditioning (V,) as a determiner of 
habit strength. Most of the data do 
not seem to warrant the conclusion 
that CS intensity does influence asso- 
ciative strength. However, it appears 
that while Hull thought of the dyna- 
mogenic properties of test stimuli 
(J.) as dependent on stimulus energy, 
he apparently thought of V, as de- 
pendent only upon the age of the stim- 
ulus trace. It now appears that the 
dynamogenic properties of stimuli, 
dependent upon intensity and adapta- 
tion level, and the degree of associa- 
tive efficacy, which depends on the 
interstimulus interval, should not be 
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regarded as manifestations of the 
same concept. 
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THE FUNCTION OF AGGRESSION AND THE 
REGULATION OF AGGRESSIVE DRIVE 


SEYMOUR FESHBACH* 
University of Colorado 


Preliminary to the development of a stimulus-mediating response 
model of aggressive drive, formal definitions are given of instrumental 
aggressive acts and aggressive drive-mediated behavior. In defining 
aggressive drive, a distinction is made between expressive aggression 
and hostile aggression which is illustrated in the difference between 
the desire to “hit” and the desire to “hurt.” Hostile aggression is 
assumed to be a learned drive whose primary antecedents are past ex- 
posure to punishment and present threats to self-esteem. Several 
procedures for reducing aggression other than performance of an 
aggressive response are described and the function of diverse meas- 
ures responsive to changes in expressive aggression, aggressive drive, 
and aggressive response strength, particularly when evaluating the 


consequences 


The general problems to which this 
paper is addressed are the functional 
properties of an aggressive act and the 
mechanisms by which aggression is re- 
duced. We will be particularly con- 
cerned with the following hypothesis 
which is central to the theoretical 
analysis presented by Dollard, Doob, 
Miller, Mowrer, Sears, Ford, Hovland, 
and Sollenberger (1939) in the frus- 
tration-aggression monograph: “The 
expression of an act of aggression is a 
catharsis that reduces the instigation 
to all other acts of aggression [p. 53].” 
There are three terms or phrases in 
this hypothesis which require further 
explication in order to ascertain its 
theoretical implications and the empiri- 
cal operations which would be appro- 
priate to a test of this hypothesis. We 
must establish what constitutes an “act 


1The author wishes to express his grati- 
tude to Norma Feshbach, Francis Irwin, and 
Paul Mussen for their critical reading of the 
manuscript and their helpful comments. 

An initial draft of this paper was prepared 
at Stanford University while the author was 
the recipient of a United States Public 
Health Research Fellowship. 


of an aggressive act, are analyzed. 


of aggression,” specify the properties of 
an “instigation to . . . aggression,” and 
evaluate the function of “catharsis” in 
determining the meaning of the above 
proposition. 


Accressive Act: INCIDENTAL, 
INSTRUMENTAL, AND HOSTILE 


A troublesome problem in the inves- 
tigation of aggressive behavior is the 
definition of the term aggression. The 
most formal attempt at a rigorous defi- 
nition of this has been made 
by Buss (1961), but, as has been indi- 
cated elsewhere (Feshbach, 1961), 
important difficulties still remain. 
There is general agreement (Berko- 
witz, 1962; Buss, 1961; Dollard et al., 
1939) that aggressive behavior denotes 
injury to some person or object. Dis- 
agreement arises in regard to the in- 
clusion of “intent” or some other 
motivational construct in the criterion 
for an overt aggressive act. Insistence 
upon exclusion of motivational consid- 
erations in categorizing an injurious 
act as aggressive results in several dif- 
ficulties. In particular, accidental in- 
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jury and the unintended infliction of 
pain (e.g., the immediate pain conse- 
quent to the dentist's injection of novo- 
cain) would be classified as aggressive 
while an intended injurious response 
that fails to achieve its goal would then 
fall into the nonaggressive class. In 
addition, there are innumerable in- 
stances in a competitive society in 
which injury to some party is an in- 
evitable consequence of the norms gov- 
erning the distribution of economic 
and prestige rewards in the society. 
To vote for Candidate A is to injure 
Candidate B although, to be sure, vot- 
ing for Candidate B might result in a 
much greater injury to the population 
at large. Firing an employee who 
needs and wants the position inflicts 
injury upon him although the firing be 
justified by the economic advantages 
to be gained by hiring cheaper or more 
competent labor. The assertion of one’s 
own interests at the cost of a competi- 
tor’s or colleague’s interests results in 
a loss to the competitor regardless of 
the legitimacy of one’s own claim. 
Social conventions attest to the dif- 
ference between competition-produced 
and aggression-instigated injury and 
loss. However, it must be noted this 
discrimination is not easily made and 
it has been observed that many mem- 
bers of our culture apparently general- 
ize the inhibitions evoked by aggres- 
sion-produced cues to cues stemming 
from the assertion of their own self- 
interest. This equivalence is evidenced 
in the several usages of the adjective 
“aggressive” which carries connota- 
tions both of assertiveness and hostility. 

For these reasons, it is useful to 
restrict the term aggression to those 
aggressive acts which are intended, or 
at least distinguish between uninten- 
tional and motivated aggression. Un- 
intentional aggression is defined as an 
act which, although resulting in injury, 
is not contingent upon its injurious 
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consequences. Intentional aggressive 
acts display the formal properties of 
a preference as defined by Irwin in his 
analysis of the relationship between 
discrimination and preference (Irwin, 
1958). Not all intentional aggressive 
acts are functionally equivalent, how- 
ever. Thus the distinction is made be- 
tween instrumental aggression which 
is directed toward the achievement of 
nonaggressive goals, and hostile ag- 
gression, or aggressive drive for which 
the goal response is injury to some 
object (Sears, Maccoby, & Levin, 
1957). Irwin's definition of “desire” 
and his demonstration that preference 
is not equivalent to desire is pertinent 
here (Irwin, 1961). Hostile aggres- 
sion is motivated by a desire for a nox- 
ious outcome while the term instru- 
mental aggression shall be restricted to 
describe behaviors producing injury 
which are motivated by a desire for 
some outcome other than injury to an 
object. Such outcomes may include 
reinforcement of motives varying from 
attention-getting to self-preservation. 
There is an increasing tendency in 
contemporary theoretical treatments of 
aggression (Bandura, Ross, & Ross, 
1961, 1962; Buss, 1961) to stress 
instrumental aggression and either dis- 
count the hostile, aggressive drive con- 
cept or relegate it to secondary impor- 
tance. The noxious consequences of 
an aggressive act are viewed as one of 
a large number of possible reinforcers, 
and the analysis of aggression pre- 
sumably need concern itself only with 
the reinforcing stimulus conditions and 
the discriminative stimuli eliciting the 
response. This analysis seems to pro- 
vide a satisfactory account of many 
forms of aggressive interaction in infra- 
human species (Scott, 1958) and has 
been applied with some success to ag- 
gressive behaviors in young children 
and adolescents (Bandura & Walters, 
1959; Buss, 1961). However, it fails 
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to do justice to the cultural implica- 
tions of an aggressive act and particu- 
larly to its interpersonal properties. 
It will be contended here that the in- 
jurious outcomes resulting from in- 
strumental aggression must be taken 
into account when considering the vari- 
ables which influence the aggressor’s 
behavior. 

The prototype of aggressive behav- 
ior in animals and young children is 
fighting and biting. An 18-month-old 
child, when thwarted, may attack the 
frustrating source by biting the object 
or by kicking or hitting it. If the frus- 
tration is eliminated—the child, for 
example, regains a toy that has been 
taken away—then these attacking re- 
sponses are presumably reinforced. On 
the next occasion, then, in which the 
child experiences a similar frustration, 
he is likely to respond in a similar 
manner. But society cannot tolerate 
for very long this simple manner of 
problem solving and the culture re- 
quires the child to make a complex set 
of stimulus and response discrimina- 
tions. On the response side, he quickly 
learns that biting is taboo, that kicking 
is unmanly, and that fighting must be 
carried out solely by the hands, un- 
aided by weapons. If he is a powerful 
youngster, he may have to learn to 
modulate the vigor of his attack. On 
the stimulus side, he may be punished 
for striking his parents, but rewarded 
for fighting his friends (Bandura & 
Walters, 1959). He has to discrimi- 
nate not only between objects toward 
whom he can express aggression with- 
out disapproval or punishment, but 
also the occasions for which an aggres- 
sive response is appropriate. He may 
be taught to defend himself but not to 
initiate an attack. Most difficult of all, 
he must learn to distinguish between 
intentional and unintentional frustra- 
tion and injury. The regulation of this 
complicated repertoire of behaviors is 


259 


effected through a variety of rewards 
and punishments which the parents 
may sometimes inadvertently adminis- 
ter and indirectly through identifica- 
tion with and imitation of the parents 
and other models. 

Many of the details of this process 
have been traced elsewhere (Bandura 
& Walters, 1963; Buss, 1961; Sears, 
Maccoby, & Levin, 1957). Of par- 
ticular relevance to the present discus- 
sion are features of the regulation of 
aggression which pertain to the imme- 
diate noxious consequences of an in- 
strumental aggressive act. Whether a 
child achieves or fails to realize some 
goal through an aggressive act, his ag- 
gression will be labeled. He will be 
told that he is hurting someone, that 
he is inflicting pain, and he learns to 
associate this label with the reaction of 
the object and with his own aggressive 
response. The individual who crudely 
sweeps others aside in his single- 
minded pursuit of a goal and who re- 
mains ignorant of the destruction and 
havoc he wreaks, is a rare person 
indeed. 

In brief, responses producing the 
cues of “it hurts” in others will lead 
to reward, nonreward, to punishment 
or to some combination, as the case 
may be. The cues can then serve to 
mediate that very diverse class of be- 
haviors, that we label aggressive.? It 
is recognized that physical aggression, 
verbal attacks, humiliation of others, 
rejection, etc., may be differentially 
reinforced. However, to the extent 
that there is functional equivalence 
between these radically different be- 
havioral forms in that preference for 
their use or avoidance in achieving 


2The cues of pain in others, as Sears 
(1958) has suggested, may become sec- 
ondary reinforcers and serve to maintain 
aggressive responses in the absence of other 
rewards. This possibility will be consid- 
ered in the discussion of aggressive drive. 
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nonaggressive goals is correlated, their 
common consequence is probably an 
important element in determining this 
set of preferences. 
The noxious consequences of an ag- 
act may exert still another 
kind of influence on the use of instru- 
mental aggression. Early in their de- 
velopmental history, children demon- 
strate affective reactions similar to 
those evoked in someone whom they 
are witnessing (Murphy, 1937). This 
response pattern is especially evident 
when children are seen watching dra- 
matic films. It has also been noted in 
daily interactions with peers. A child 
may become anxious and distressed 
simply by observing an act of aggres- 
sion directed toward another child. 
The first child responds vicariously to 
the experiences of the second. Ob- 
servation by an empathic child of the 
consequences of an aggressive act di- 
rected toward another child should 
tend to elicit distress responses in the 
child observer even if he himself is 
the instigator of the aggressive act. 
One would expect these painful, vicari- 
ous responses to function as inhibitors 
of the child’s own aggressive responses. 
These inhibitory tendencies should oc- 
cur even if the instrumental aggressive 
act is successful in achieving some non- 
aggressive goal. Thus children high 
in empathy should, on the whole, mani- 
fest less overt aggression than children 
low in empathy.* 


3 This correlation alone is not sufficient to 
demonstrate a genuine empathy response. 
Sources of pseudo-vicarious instigation have 
been reviewed in detail by Berger (1962). 

4 This straight-forward deduction from the 
assumption of empathic inhibition has yet to 
be directly confirmed. In Lois Murphy's 
(1937) study of sympathy in nursery school 
children, a positive correlation was obtained 
between teachers’ ratings of aggressive be- 
haviors and ratings of sympathetic behaviors 
which are similar in a number of instances 

to indexes of empathy. This positive rela- 
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Although there is a dearth of rele- 
vant empirical data, informal, clinical 
observations suggest that empathic in- 
hibition often plays a significant role 
in restraining the infliction of pain on 
others. Interesting psychological ques- 
tions arise in connection with the ge- 
netic antecedents of this process and 
with the dynamic, situational factors 
which attenuate and exaggerate its po- 
tency. In regard to the latter, simi- 
larity between the observer and the 
victim should facilitate its potency. In 
aggressing against another person, it 
is convenient to perceive him as some- 
one very different and, if at all possi- 
ble, to dehumanize him. Shylock’s 
famous soliloquy, affirming his mem- 
bership in the human species, is an 
attempt to undo this differentiation and 
thereby reduce aggression directed to- 
ward him. The immediacy and inten- 
sity of the pain should also be relevant 
factors. The discomfort produced by 
physical attack is direct and apparent, 
the pain produced by rejection is distal 
and more complexly determined. The 
parameters of empathic inhibition may 
help explain why physical punishment 
for criminals is frowned upon in mod- 
ern penal systems, although other ex- 
tremely punitive devices still persist. 
In the hedonic calculus employed by 
the prisoner, 20 lashes may be prefera- 
ble to 10 days of isolation. A further 
factor in discouraging the persistent 
use of physical punishment is its pos- 
sible “brutalizing” influence upon the 
punishing agent. While the aggressor 
is motivated by empathic responses to 


tionship cannot be taken as strong evidence 
against the hypothesis since, as the author 
suggests, both aggressive and sympathetic 
responses are instances of socially oriented 
behavior. The positive correlation in this 
age group, then, appears to be largely a func- 
tion of the social maturity of the child and 


the extent to which he participates in social 
interaction. 
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inhibit aggressive acts, if, for other 
reasons, he continues to perform these 
aggressive acts, the empathic-produced 
pain may result in the evocation of 
responses which are incompatible with 
the empathy response. When a father 
apologetically tells his son while spank- 
ing him, “this hurts me more than it 
does you,” his self-report may be ve- 
ridical for the initial spankings, but if 
this mode of punishment is frequently 
used, the empathic response is likely 
to be weakened. Again, informal ob- 
servation suggests that continual ex- 
posure to human pain and suffering 
will result in the atquisition of defen- 
sive behaviors—e.g., intellectualization 
—which are incompatible with the 
empathy experience. 

To summarize, it is proposed that 
even in so-called “instrumental aggres- 
sion,” the noxious consequences of an 
aggressive act influence the behavior 
of the aggressor. Moreover, as the 
psychoanalysts are fond of arguing, 
much of human behavior seems over- 
determined. The boy who strikes his 
younger sibling to get his mother’s at- 
tention may well obtain a small quan- 
tum of satisfaction from his sibling’s 
discomfort as well as the gratification 
attendant upon his mother’s presence 
and interest. The remaining discus- 
sion will be concerned with aggressive 
behaviors for which the noxious con- 
sequences are the primary goal. 


INSTIGATION TO AGGRESSION 


The frustration-aggression hypothe- 
sis asserts that frustration of an on- 
going activity produces an instigation 
whose goal response is injury to some 
person or object. This conceptualiza- 
tion retains most of the dynamic prop- 
erties of the psychoanalytic theory of 
aggression without the limiting and 
unnecessary assumption of an instinc- 
tive drive. The strength of the insti- 
gation is directly linked to the fre- 
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quency and intensity of frustrating 
experiences and in contrast with an 
instinct assumption, data can be ob- 
tained bearing upon the utility of this 
hypothesis. Criticisms of the frustra- 
tion-aggression hypothesis have been 
largely concerned with the invariance 
of this relationship and with the occur- 
rence of nonaggressive responses to 
frustration, Insufficient attention, how- 
ever, has been given to the “instiga- 
tion” construct and particularly to the 
nature of the goal response, Although 
Dollard et al. (1939) in their original 
formulation used the concept of insti- 
gation in a rather broad, general sense 
to refer to the antecedent conditions 
correlated with an aggressive response, 
the use of the term “goal 
suggests a correspondence between the 
notion of drive and instigation. Also, 
the extent to which learning enters 
into the frustration-aggression rela- 
tionship is left ambiguous. It was 
sufficient for the author’s purpose at 
that time to state this hypothesis re- 
gardless of its origins. However, in 
order to derive means of controlling 
and reducing aggressive drive, further 
analysis is required 

In considering the role of learning 
it is helpful to distinguish between 
the rage or anger response and the 
motivated infliction of injury. Sears, 
Maccoby, and Levin (1957) have noted 
that anger “appears to be an instinctive 
response to certain kinds of restraint 
and discomfort” while the desire to 
hurt someone in the process, however, 
is not so clearly instinctive. This dis- 
tinction is essential to the analysis of 
the dynamics of anger and aggression 
that is presented in this paper. All 
emotions have expressive components 
and most emotions appear to have a 
directive component. In the case of 
anger in infants, the expressive com- 
ponent includes changes in facial ap- 
pearance, the emission of loud sounds 
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and agitated, vigorous movement of 
the limbs (Goodenough, 1931). This 
pattern of behavior is assumed to be 
an innate response in the human infant 
to frustration. As the infant acquires 
greater motor control and perceptual 
sensitivity, these external movements 
become less diffuse and more specifi- 
cally directed toward the perceived 
source of frustration. These aggressive 
acts will on occasion have instrumental 
value in removing the frustrating 
agent, and, to the extent that the be- 
havior is rewarded, the innate connec- 
tion between frustration and the anger 
reaction will be strengthened. To 
paraphrase the frustration-aggression 
statement, in this early state of devel- 
opment (from infancy through ap- 
proximately the second year), frustra- 
tion produces an instigation to hit 
rather than to hurt; the topography of 
the response is more important than 
its noxious consequences. Given a 
frustrating agent and a choice of three 
alternatives such that the child can 
respond with a vigorous, aggressive 
motor response with no pain feedback 
from the object, or can inflict injury 
upon the object by a mild motor re- 
sponse such as pulling a lever, or wit- 
ness someone else inflict injury upon 
the object, the 2-year-old child should 
prefer the first alternative. 

The relative neglect of the expres- 


The relationship between instrumental, 
expressive, and hostile aggression may be 
summarized as follows: Using Irwin's 
(1961) terminology, all three types reflect 
aggressive preferences. Instrumental ag- 
gression is motivated by a desire for non- 
aggressive outcomes; hostile aggression is 
motivated by a desire for aggressive (nox- 
ious) outcomes while expressive aggression 
is motivated by a desire for a particular 
form of responding. Although Irwin's analy- 
sis of desire was restricted to response out- 
comes, it can also be usefully extended to 
response modes providing that the choice of 
response mode has some instrumental func- 
tion. 


sive component of anger in comparison 
with its destructive aspects is under- 
standable in view of the prevalence of 
hostility in human relations. How- 
ever, its functional significance was 
discussed in detail by Darwin almost 
a century ago. Darwin (1873) noted 
the threat function of an aggressive 
display and his first principle of “serv- 
iceable associated habits” is concerned 
with emotional expression or remnants 
of action that once were of direct use 
to the organism. Of more interest to 
us is his third principle of actions due 
to the constitution of the nervous sys- 
tem. Here he views emotional ex- 
pression as an overflow of energy. 
The excited motor activity of the en- 
raged animal serves as a means of ten- 
sion regulation and discharge. There 
are striking points of similarity be- 
tween Darwin’s third principle and 
Freud’s Pleasure Principle. It will be 
recalled that Freud postulated that 
instinctual impulses seek immediate 
motor discharge. It seems reasonable 
to assume that the expressive responses 
associated with anger arousal serve 
this function. It should be noted that 
a significant change has been intro- 
duced in the character of the instinctual 
impulse of aggression as Freud con- 
ceived it. To the extent that it is 
meaningful to speak of discharge and 
release of an instinctual aggressive im- 
pulse, the index of release is affective 
expression and not injury to some 
person or object. 

As part of the socialization process, 
one learns to inhibit affective expres- 
sion as one is taught to inhibit destruc- 
tive behavior. The child learns to 


„In their discussion of the development 
of aggression, Sears, Maccoby, and Levin 
(1957) observe that children seem to experi- 
ence relief by acting out aggressive impulses 
even though other needs may not be grati- 
fied. They suggest that the satisfaction and 
relief comes from the reduction of conflict 


S substitute language and fantasy for 
some forms of overt motor expression 
(Rapaport, 1952) and the development 
of thought processes enables him at 
the same time to delay impulse expres- 
sion. Acknowledgment, rehearsal, and 
the verbal communication of feelings 
(aided, as a rule, by facial gestures) 
take the place of the motoric expres- 
sion of emotional tension. The sensori- 
tonic theory as formulated by Werner 
and Wapner (1952) assumes a vicari- 
ous relationship between motor inhibi- 
tion and cognitive expression of ten- 
Sion. The results of several studies 
concerned with the relationship be- 
_ tween motor inhibition and Rorschach 
Movement responses are consistent 
with this assumption (Meltzoff, Singer, 
& Korchin, 1953; Singer, Meltzoff, & 
Goldman, 1952). The writer has re- 
ported an experiment in which sub- 
jects, encouraged to deny their feelings 
and then exposed to a fear-arousing 
Situation, projected reliably more anx- 
iety onto a stimulus person in a film 
than a comparable group of subjects 
who had been encouraged to express 
their feelings (Feshbach & Singer, 
1957). In a more recent study con- 
cerned with the restraint of positive 
feelings (Feshbach, 1963), it was ob- 
served that subjects instructed to re- 
frain from communicating a successful 
achievement through verbal or facial 
Means rated themselves as reliably 
more happy and attributed more hap- 
Piness to others than a similarly suc- 
cessful group who were not exposed 
to this restraint condition. The latter, 
in fact, hardly differed from a non- 
rewarded control group. While these 
studies are by no means conclusive, 


they are consistent with the assump- 
—ͤ — 
vine from the inhibition of aggression. 
hypothesis is similar to the affect- 
charge formulation and from an opera- 
tional standpoint would be very difficult to 
peo ech from it. 
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tion of a vicarious relation between 
affective inhibition and cognitive ac- 
tivity such that cognitive expression of 
affect may serve as a substitute for 
motor expression of affect. 

The problem of substitute value will 
be examined in the context of the dis- 
cussion of catharsis and the reduction 
of aggressive drive. Before so doing, 
we have yet to consider the most strik- 
ing aspect of aggressive drive, namely 
an organized pattern of behavior for 
which the goal response is the infliction 
of injury. Having rejected the notion 
that this destructive impulse is innate 
either in the Freudian sense of an 
instinctual drive or as an unlearned 
reaction to frustration, the problem 
arises of accounting for its origins. We 
shall be particularly interested in the 
evocation of hostile impulses as a reac- 
tion to frustration and threat, recog- 
nizing that there are other sources of 
aggressive drives, e.g., erotic impulse 
gratification which psychoanalytic the- 
ory suggests is a major element in 
sadism. 

Sears (1958) has proposed that the 
motive to injure others becomes ac- 
quired through a process of secondary 
reinforcement. Successful instrumen- 
tal aggressive acts result in the elimi- 
nation of frustration and also in the 
evocation of pain responses from the 
frustrator. Because of the consistent 
association of the perception of pain 
in others with removal of frustration, 
the former acquires secondary reward 
value. While this analysis has been 
extended by Sears and his collabora- 
tors to account for aggressive behavior 
mediated by an acquired aggressive 
drive which is independent of frustra- 
tion, we shall limit our discussion of 
this developmental account to the sim- 
pler situation in which an injured or- 
ganism finds satisfaction in inflicting 
pain upon the frustrating source. 

There is at the present time no good 
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evidence to support or refute the sec- 
ondary reinforcement explanation of 
the origin of the impulse to injure 
others. One can, however, point to 
several difficulties presented by this 
particular theoretical account which as- 
sumes that the infliction of pain results 
in the attainment of the frustrated goal, 
not invariably, to be sure, but at least 
with sufficient frequency for the rein- 
forcement to generalize to the cues 
immediately preceding it. One may 
question the extent to which this as- 
sumption holds for most young chil- 
dren. Rather, hurting one’s parents 
or one’s older siblings is likely to be 
met with punishment and also with 
continued frustration of the blocked 
goal. An important exception to the 
latter would be negative attention-get- 
ting behavior which may be reinforced 
even though the attention may take the 
form of punishment of the child. At- 
tention, which is itself a derived moti- 
vation, would then be made to carry 
the reinforcement burden. A more 
removed but relevant difficulty is the 
observation made by students of ani- 
mal aggression that aggressive behav- 
ior in species other than homo sapiens 
is ae always instrumental to the 
achievement of nonaggressive s 
(Scott, 1958). Animals do not 5 
narily attack or kill other animals in 
order to elicit the cues of pain despite 
the frequency of successful instrumen- 
tal aggression associated with some 
species and with some dominant or- 
ganisms within a species. The deriva- 
tion of satisfaction from eliciting pain 
and suffering in others is a peculiarly 
human phenomenon, suggesting that 
we look to factors that are uniquely 
human as the source of this mode of 
satisfaction. 

One such factor is the child’s ability 
to label and comprehend the contingent 
events in his environment. We have 
previously noted that the noxious con- 
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sequences of a child’s aggressive acts 
are usually labeled as such. When he 
is punished for the expression of ag- 
gression, as well as other misdemean- 
ors, he learns to associate the infliction 
and perception of pain in others (pro- 
duced by his own aggression) with the 
experience of pain in himself. More- 
over, these are crudely correlated such 
that the greater the amount of pain 
inflicted by the child, the greater the 
degree of punishment received. Young 
children, as it were, are subjected to 
the “lex talionis.” More than fi s 
acquired when the child is punished! 
He also observes and imitates the be- 
havior of the punishing agent (Ban- 
dura, Ross, & Ross, 1961, 1963). 
Through a process of modeling, he 
learns a norm or law which essentially 
states that the infliction of pain is the 
response made to the experience of 
pain. He learns to apply this law to 
certain classes of “infliction of pain” 
which are categorized as “intentional” 
and “unfair.” He learns that it is 
appropriate, even required, to retaliate 
—that is, to inflict injury, under cer- 
tain social circumstances. The cues of 
pain elicited in the source of frustration 
signify that the requirements of the 
situation have been met. It is not 
enough to hit; one must also hurt. 
This is not to imply that the learned 
retaliatory behavior pattern is com- 
pletely unrelated to the “natural” re- 
sponse to frustration and provocation 
The anger evoked in such situations 
Serves to exaggerate the response 
strength of these primitive retaliatory 
tendencies, In addition, the expres- | 
sive responses associated with anger 
can be readily utilized to clicit pain 
cues in others. Finally, the affective — 
anger response may become part of 
the stimulus complex which initiates 
the motivation to inflict injury. How- 
ever, despite the likelihood that anger 
and aggressive drive (hostility) will 
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be linked together, this need not be 
the case. The motivation to injure 
others—or aggressive drive which 
can be influenced by anger is, accord- 
ing to the present theoretical analysis, 
an acquired motivation based upon the 
internalization of a standard which 
the child acquires from exposure to 
particular cultural values and from 
his own concrete experiences, Thus 
this analysis directly links the moti- 
vation to injure others to the degree 
of punishment which the child has 
received, especially for the expression 
of aggression. There are a substantial 
numbers of findings consistent with 
this hypothesis (Bandura & Walters, 
1959; Sears, Macoby, & Levin, 1957; 
Sears, Whiting, Nowlis, & Sears, 
1953). There are, however, several 
alternative explanations of the corre- 
lation obtained between punishment 
and aggressive behaviors, and more 
precise measures, particularly of the 
reinforcing properties of the infliction 
of pain, are necessary in order to prove 
evidence more specific to this punish- 
ment model hypothesis. An adequate 
test of this hypothesis would require 
separate indexes of these reinforcing 
effects and the strength of instru- 
mental aggressive responses. We 
would expect that individuals who 
have had a history of experiencing 
retaliation and counteraggression when 
they aggress, in comparison to indi- 
viduals who have a history of reward 
for aggressive responses, would be 
more likely to be reinforced by the 
Suffering of a provocateur and would 
be more likely to persist in a sequence 
of responses which can be anticipated 
to culminate in the infliction of in- 
Jury to the provocateur. 

The hypothesis that children ac- 
quire aggressive drive through example 
and percept may account for retalia- 
tory tendencies, but only partially ex- 
Plains the “satisfaction” derived from 
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the perception of pain in one's tor- 
mentors. The rewarding properties 
of this perception may also arise from 
its status and power implications. We 
have attempted to draw a sharp dis- 
tinction between aggressive acts which 
are instrumental to the attainment of 
nonaggressive goals and aggressive 
acts which are motivated by the intent 
to inflict injury upon some person or 
object. This distinction becomes 
blurred when hostility is elicited by 
a threat to self-esteem. And in human 
beings, much, if not most occasions 
for hostility fall into this category, 
some authors, in fact, considering 
threats to self-esteem as the primary 
source of aggression (Worchel, 1960). 
Thus, the failure to receive an ex- 
pected invitation to some gathering 
implies more than interference with a 
potential pleasurable experience. A 
student may be annoyed and disap- 
pointed at receiving a failing grade. 
But if he should discover that the 
instructor is pleased by his failure, 
he is likely to become furious. A 
week’s solitary confinement in a nar- 
row cell with room barely wide enough 
to sit is a tremendously frustrating 
and painful experience. Prisoners be- 
come angry and bitter when subjected 
to this experience as a punishment. 
But the same experience when endured 
in connection with voluntary partici- 
pation in a research project designed 
to assess the psychophysiological ef- 
fects of confinement in a space cap- 
sule will elicit far less hostility. The 
difference in these situations is in the 
implied devaluation and impotence of 
the frustrated individual. Successful 
retaliation changes his status relation 
vis-a-vis the perceived source of de- 
valuation. He has demonstrated that 
he is not helpless and impotent. More- 
over, he need not be responsible for 
the hostile act. The initial instigator 
by virtue of being subjected to painful 
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experiences matching those he inflicted 
on the victim is reduced to the vic- 
tim’s status. The nature of the de- 
valuation and the degree of aggression 
anxiety would determine the relative 
preference for self-initiation of an ag- 
gressive act versus infliction of injury 
upon a disliked person by some other 
individual. That the latter can be ef- 
fective is indicated by the Rosenbaum 
and DeCharms study (1960) in which 
the level of aggression of Subject A, 
who had been attacked by B, was 
reduced by overhearing Subject C at- 
tack B. The mediating process, how- 
ever, is unclear in that the reduction 
in aggression in this study may have 
been due to the support of A implied 
by C’s attack with consequent reduc- 
tion of the threat to self-esteem rather 
than due to the pain experienced by 
B. 

The postulated structural relation- 
ship between threats to self-esteem 
and satisfaction from the infliction of 
injury in conjunction with the acquisi- 
tion of the pain-infliction punishment- 
retaliation norm are proposed as the 
primary source of the reinforcing 
properties of the perception of pain in 
others. We have argued that these 
variables play a more significant role 
in determining preferences for be- 
havioral alternatives which elicit pain 
responses in others than the associa- 
tion of such stimuli with primary re- 
inforcement. It has also been main- 
tained that the relation between anger 
and the motivation to inflict injury 
upon others is neither intrinsic nor 
arbitrary and that the expression of 
anger is not equivalent to but falici- 
tates aggressive drive-oriented be- 
havior. In addition to the question 
of the antecedents of aggressive drive, 

this analysis also bears directly upon 
the problem of the reduction of the 
“instigation to aggression.” 
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CATHARSIS AND THE REDUCTION O 
THE INSTIGATION OF 
AGGRESSION 


In order to delineate distinctions 
between what is reduced or modifi 
we shall adopt the following termin 
logical usage which is similar in à 
number of respects to that suggested 
by Buss (1961): (a) The terms, hos- 
tility, hostile impulse, and aggressive 
drive shall be used as equivalents; 
hostile behavior denotes responses 
which are primarily directed toward 
the infliction of injury upon some 
other person or object. (b) Anger 
refers to mediating affective responses 
whose expressive aspects have already 
been described. The intensity of anger 
5 M correlated with the vigor of 
the hostile response and with the de- 
gree of injury which the aggressor is 
motivated to inflict upon the object. 
However, expressions of anger, though 
they may result in injury to some 
person or object, are not necessarily 
hostile in intent. Also, hostility may 
be strong although the affective re- 
sponse is short-circuited or blocked. 
(c) Aggression refers to the commis- 
sion of acts which produce injury 
in others where the injury is instru- 
mental to the achievement of nonag- 
gressive goals or is itself the principal 
goal, 

The anger-hostile impulse contella- 
tion is elicited by a variety of events 
broadly labeled frustration which in- 
clude interference with ongoing in- 
strumental activity, verbal and physi- 
cal attack, and threats of self-esteem. 
The reduction of anger and aggres- 
sive drive can be accomplished 
through (a) injury to the frustrating 
source or some form of displaced ag- 
gression; (b) facilitation of mediat- 
ing responses which are incompatible 
with anger and hostility; (c) modifi- 
cation of the initial eliciting stimulus 
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A. 
(frustration, 
ego threat) 


(anger) 


Ri (motor agitation) 
R. (hitting) 


R; (vocalizations) 


Stimulus ——+ rp, ———___________, TrSe 
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7 Ri (physical aggression) 
(anticipatory f—f—R: (verbal aggression) 
evocation of Rs (passive aggression) 
pain cues) Ra 


. (anger breeds destruction) 
B. Stimulus — r, (To err is human, to —R, (turning the other cheek) 
(frustration, |}— forgive, divine) — R: (withdrawal) 
ego threat) — . (Love thine enemy) R (pacifying attempts) 
(positive —R, 
affect) 
7. (I know I can succeed) 


C. Stimulus —— rp, 
(frustration, (anger) — s, (anticipatory elimination |— 
ego threat) l 
Rı (motor injustice) 
agitation) 


of disease, poverty and 


~R: (increased striving) 

>R: (participation in 
social movements, 
in scientific re- 
search, etc.) 


Fic. 1. Schematic representation of alternative mediating responses 


to “aggressive” 


condition either through removal of 
the stimulus or through a change in 
the meaning of the stimulus. Cate- 
gories b and c represent nonaggres- 
sive methods of “reducing” aggressive 
drive, 

An example of Category b is the 
advocacy and acquisition of positive 
affective responses to the stimulus con- 
ditions which ordinarily elicit hostility 
as in the exercise of Christian love 
and charity toward one’s enemy. It 
may also be possible and, in the opinion 
of some, desirable to accomplish a 
Similar end while maintaining the 
anger response, by strengthening non- 
hostile responses to anger (Kaufmann 
& Feshbach, 1963a, 1963b). Stratton 
(1923), some 40 years ago, stated 
and eloquently argued for this alterna- 
tive. He notes the undirected char- 
acter of anger— 


anger is thus seen to be a highly unspecial- 
ized compulsion; it has no immediate inter- 
est of its own, but arises to protect and to 
further any interest that you feel [p. 254]. 


provocation. 


He then suggests that anger can be 
utilized in the service of socially con- 
structive pursuits.” This is what Wil- 
liam James (1917) presumably had 
in mind in his advocacy of a “moral 
equivalent of war.” 

These alternative possibilities are 
diagrammed in Figure 1. The com- 
plete short-circuiting of anger and the 
attachment of nonaggressive mediating 
responses directly to the instigating 
stimulus is depicted in Figure 1B 
while examples of nonaggressive re- 
sponses to anger are suggested 
in Figure 1C. The statements in the 
parentheses adjacent to the mediating 
stimuli are intended only as short- 
hand descriptions of the functions of 
the mediating process and are not 
meant to imply that these processes 
are conscious or solely cognitive. 


7 Stratton (1923) graphically illustrates 
the dynamogenic properties of anger in the 
following quote from a nineteenth century 
California preacher: “Although I weigh one 
hundred and twenty pounds, when I am mad, 
I weigh a ton.” 
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While the nonaggressive mediating re- 
sponses should theoretically serve to 
mitigate hostile reactions, the precise 
empirical consequences of these medi- 
ating responses—particularly with re- 
spect to the possibility that they may 
function as intellectualized defenses 
or may exacerbate conflict over ag- 
gressive impulses—have yet to be as- 
certained. 

The modification of the initial 
stimuli eliciting hostility is probably 
the most effective means of reducing 
aggressive drive. A gratuitous insult 
and snub may arouse anger and the 
desire to hurt the insulter in some 
way. Let us assume that one dis- 
covers that no insult or snub was 
intended, that one had misinterpreted 
the presumed “insulting” comments. 
Under these circumstances the anger 
and hostility should disappear. Those 
operating within the framework of a 
psychoanalytic drive model might 
argue that the hostility has been in- 
hibited or repressed and that it must 
be expressed in order for the drive 
tension to be reduced. The stimulus- 
mediating response model which has 
been presented here takes a simpler 
view of the matter. The stimulus 
situation has fundamentally changed 
and this new meaning evokes a new 
response. There is no need to satisfy 
aggressive drive because the stimulus 
conditions which provoked hostility 
are no longer present. Changes in the 
stimulus are frequently accomplished 
through changes in mediating re- 
sponses, such that new perceptions of 
the same situation result from the 
attachment of different mediating 
labels, e.g., the classification of a pre- 
viously provocative situation as „un- 
intentional” or the act of an „irre- 
sponsible” agent. As part of the 
socialization process, such labels be- 
come discriminative cues to which non- 
hostile responses have been fostered 
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by the culture. This process by which 
mediating labels modify the interpre- 
tation of a situation will in many in- 
stances be indistinguishable from the 
previously discussed process of miti- 
gating hostility through the attach- 
ment of nonaggressive mediating 
labels to the same situation. The 
advocacy of positive, nonaggressive 
reactions to provocation is often ac- 
companied by rationales which sug- 
gest that the provocateur is “sick” or 
is a “product of his environment,” 
thereby modifying the perception of 
the intentionality of the act and re- 
ducing its ego-threatening implica- 
tions. 

In addition to modifying the ex- 
ternal stimulus conditions, the impact 
of the noxious stimulus can also be 
altered by modifying the vulnerability 
of the insulted individual. If the 
threat to self-esteem is subsequently 
reduced through a procedure in which 
the initial instigator is not involved, 
the insulted person should still become 
less hostile to the instigator. Evidence 
consistent with this expectation has 
been reported by Worchel (1961) and 
Magaziner and Feshbach (1961). 
Changes in self-esteem often accom- 
pany the expression of hostility and 
complicate the evaluation of the ag- 
gressive drive-reducing effects of the 
infliction of injury (Rosenbaum & De- 
Charms, 1960; Thibault & Coules, 
1952). We distinguish here between 
any changes in self-esteem which may 
be an intrinsic consequence of the in- 
fliction of injury and modifications in 
esteem which are independent of any 
pain experienced by the object of ag- 
gression. 

These complicating effects of self- 
esteem are most apparent in discus- 
sion and studies of catharsis, which 
are further complicated by the variety 
of meanings which have been at- 
tributed to this concept. When the 
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term was initially used by Freud and 
Breuer (1948) in their papers on the 
treatment of hysteria, it referred to 
the tension-reducing consequences of 
emotional expression: in the case of 
the affect-drive system of anger-hos- 
tility, giving vent to one's hostile 
feelings was thought to reduce the in- 
tensity of the hostility even if no dam- 
age had been inflicted upon the object 
of the hostility, The term catharsis 
is sometimes restricted to verbal ex- 
pressions of affect while abreaction is 
used to refer to motoric affective ex- 
pressions. In the quotation from the 
frustration-aggression volume at the 
beginning of this paper, catharsis de- 
noted a reduction in aggressive drive 
consequent to the infliction of injury. 
These two usages of catharsis * cor- 
respond to the distinction which has 
been made between expressive aggres- 
sion and hostile aggressive drive. The 
acts of aggression that have been 
varied in experimental studies of 
catharsis usually have both an ex- 
pressive aspect and a noxious conse- 
quence although some primarily fall 
into the former category (Feshbach, 
1955; Siegal, 1956; Singer & Rowe, 
1962), while others fall predominately 
in the latter (Berkowitz, Green, & 
Macauley, 1962; Hokanson & Burgess, 
1962; Hokanson & Shetler, 1961). 
Apart from the fact that studies of 
catharsis have not been designed to 
separate the effects of the expressive 
and noxious components of an aggres- 
sive act, data bearing upon the drive- 
reducing effects of some direct or 


° We omit here any reference to the origi- 
nal Greek meaning of catharsis which con- 
notes 


the state of feeling produced by the 
dramatic tragedy. It meant the stillness 
at the center of one’s being which came 
after pity or fear have been burned out. 
The soul is purified and calmed, freed 
aes 1 5 violent passions [Scheer, 1961, 
p. 520]. 
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indirect aggressive behavior may well 
yield conflicting results because of the 
complex consequences of aggressive 
behaviors (Berkowitz, 1958, 1960). 
Thus, one implication of the present 
stimulus-mediating response analysis 
of anger and hostility is that participa- 
tion in aggression—whether directly 
or vicariously—will have no anger or 
hostility-reducing effect unless anger 
or hostility has been aroused at the 
time the aggressive act is performed 
(Feshbach, 1961). Witnessing ag- 
gressive sequences in films or par- 
ticipating in a boxing match cannot be 
expected to provide a vicarious ex- 
pressive or hostile function without 
the organism being aggressively moti- 
vated. If anger-hostility is not aroused, 
then an increase in subsequent aggres- 
sion may take place as a result of 
reinforcement of the aggressive re- 
sponse, reduction in aggression-anxiety 
or aggressive response-eliciting prop- 
erties of aggressive stimuli (Ban- 
dura, Ross, & Ross, 1961, 1962; Fesh- 
bach, 1956; Kenny, 1953; Lovaas, 
1961). 

In addition, although aggressive 
drive may be reduced by an aggres- 
sive act, this effect may be obscured 
by reinforcement of the aggressive re- 
sponse, by lowered inhibition, and by 
more subtle influences such as dislike 
induced possibly by cognitive dis- 
sonance (Berkowitz, Green, & Mac- 
auley, 1962), so that some measures 
of aggression may reflect an increment 
in aggressive behavior. If the meas- 
ures of aggression one utilizes are about 
equally responsive to changes in drive 
strength, then the more similar the re- 
sponse assessed in the dependent meas- 
ure is to the rewarding, drive-reducing 
aggressive act, the more likely one is 
to find evidence of an increment of 
aggression. Dissimilar measures, in 
comparison, will be more likely to re- 
fleet decrements in aggression. In 
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ing the consequences of an aggressive 
act, it becomes essential to utilize a 
variety of measures which are differ- 
entially responsive to the diverse effects 
which have been reviewed above. For 
example, in a recent unpublished study 
carried out by the author, angered 
subjects who were given the oppor- 
tunity to express their feelings in a 
sympathetic interview situation, in 
comparison to a control group, showed 
a significant increment in aggressive 
word associations, but at the same 
time, a significant decrement in dis- 
placement of aggression toward an in- 
nocent target. Thus in evaluating the 
effects of an aggressive activity upon 
subsequent aggression, opposite infer- 
ences can be drawn depending upon 
the measures of aggression used. The 
problem is not merely one of quanti- 
tative differences in the reliability or 
sensitivity of the measures employed, 
but rather of qualitative differences 
in the variables which are being as- 
sessed. One purpose of this paper 
has been to clarify the differential 
properties of these variables by means 
of the formal distinctions which have 
been made regarding the nature of ag- 
gressive drive and the relationship be- 
tween aggressive drive and instru- 
mental aggression. 

It is recognized that these theoreti- 
cal distinctions place rather severe re- 
quirements upon the measuring instru- 
ments that should be used in studies 
of aggression. The difficulty in sepa- 
rating the drive and habit components 
of a response is nowhere more acute 
than in the case of aggressive be- 
havior. Expressive, hostile, and in- 
strumental functions are interwoven 
im most aggressive acts and these f unc- 
tions are likely to be further obscured 
by inhibitory factors. Nevertheless, it 
is possible through appropriate opera- 
tions to disentangle these properties. 
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Such an approach is necessary, we be- 
lieve, to provide a meaningful response 
to the issue which is central to this 
discussion—namely, the functional 
value of an aggressive act—and to 
intimately related applied social issues 
which include questions concerning the 
effects of violence in television and 
other forms of mass media, the role 
of social prejudice and intra- and 
intergroup conflicts in personal adjust- 
ment, the pathological consequences of 
restraint versus expression of hostile 
impulses and, more generally, the ne- 
cessity for a culture to provide socially 
approved outlets for aggressive ex- 
pression. 


SUMMARY 


In order to provide a model to ac- 
count for the cathartic effects of an 
aggressive act and the conditions 
which result in a reduction in aggres- 
sive drive, the terms act of aggression 
and aggressive drive are analyzed. A 
definition of accidental and intentional 
aggression is offered. The latter cat- 
egory includes aggressive drive-medi- 
ated behavior and also instrumental 
aggression which denotes aggressive 
acts which are reinforced by the attain- 
ment of nonaggressive goals. It is 
argued that even in instrumental ag- 
gression the noxious consequences of 
the aggressive act are significant deter- 
minants of the aggressor’s behavior be- 
cause of the mediating function of 
the labeling of these consequences and 
because of the inhibitory effects of 
empathic responses. In defining ag- 
gressive drive, a distinction is made 
between expressive aggression and hos- 
tile aggression which is illustrated in 
the difference between the desire to hit 
and the desire to hurt. The essential 
element in expressive aggression is the 
topography of the response while in 
hostile aggression, the critical factor is 
the noxious consequences of the act. 


FUNCTIONS OF Accaession 


Hostile aggression is assumed to be a 
learned drive whose primary but not 
sole antecedents are past exposure to 
punishment and present threats to self- 
esteem, 

A stimulus-mediating response 
model of aggressive drive suggests a 
number of procedures for reducing 
aggression other than performance of 
an aggressive response. These include 
modification of the initial eliciting 
stimulus conditions and facilitation of 
responses which are incompatible with 
anger and hostility. The consequences 
of aggressive actions are more complex 
inasmuch as they may entail changes 
in the strength of the expressive im- 
pulse, in hostile aggressive drive, in 
aggression anxiety and in the habit 
strength of the aggressive response. 
Since different measures of aggression 
may be differentially responsive to 
these various processes and may yield 
diametrically opposite effects, the use 
of diverse measures of aggression is 
required to evaluate the cathartic ef- 
fects of a vicarious or direct aggres- 
sive response upon aggressive drive. 
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The role of perceptual variables in Behavior Theory was examined in 
terms of their implications for stimulus-intensity generalization and 
discrimination learning. Examination of the so-called sensory-interac- 
tion effects seriously questioned the validity of the afferent stimulus- 
interaction postulate. The postulate predicts a greater 

fall for reaction potential to a test stimulus which differs from CS by x 
units on one dimension and by y units on another than to a test stimulus 
which differs from CS by x units on the first dimension and is identical 
with respect to the other dimension. Considerations from the sensory- 
interaction literature suggest that under some conditions the opposite is 
true. Results of experiments on stimulus-intensity generalization and 
discrimination learning were found to conflict with the predictions de- 
rived from Behavior Theory, supporting the view based on sensory-inter- 
action effects. Treating drive (D) in terms of stimulus component (Sp) 
and suggesting that its effects might under some circumstances be a 
special case of a sensory-interaction effect, the equivalence of the con- 
sequences of stimulus intensity and drive intensity were examined. Ex- 
perimental results supported the view proposed. An argument was 
advanced for an increased role of perceptual variables in Behavior 
Theory. 


given sense modality are often mark- 
edly influenced by the simultaneous 
(or near-simultaneous) stimulation of 
another sense modality. The causes of 
sensory interaction effects are far from 
well understood, and the effects are by 


In his last book Hull (1952) wrote: 


Perhaps one of the reasons why the failure to 
distinguish sharply between stimulation and 
perception does not interfere any more mark- 
edly with the validity of the present deduc- 
tions is that the elements of both stimulation 
and generalization are explicitly included in 


the system [p. 354]. 


The distinction between perception 
and stimulation has, in fact, been 
regarded as unnecessary in behavior 
theory for many years. There is, 
however, a growing body of literature, 
never directly associated with be- 
havior theory, which might question 
this position. This body of literature, 
generally referred to under the rubric 
of sensory interaction, contains a fair 
amount of evidence that perceived 
magnitudes of stimuli impinging on a 
1Studies reported here were part of a re- 
search program supported jointly by Grants 
AP. 49 (638)367 and G-4951 from the Air 
Force Office of Scientific Research and the 
National Science Foundation, issued to the 
Research Center for Group Dynamics. 
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no means universal or unequivocal. 
But one can hardly question the fact 
that sensory interaction effects do 
occur. Viewed in the light of the 
Hullian position the results of sensory 
interaction research have potentially 
significant implications for behavior 
theory. The present paper attempts 
to examine some of these implications 
and to demonstrate that the failure to 
distinguish between stimulation and 


2 The reader can easily observe a sensory 
interaction effect some quiet evening by 
lifting his telephone receiver, listening to the 
dial tone, fixating at a light bulb, and having 
someone switching it on and off in rapid 
succession. The amplitude of the dial tone 
will appear to oscillate in phase with the 
light. For best results no other light source 
should be present. 
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perception does in fact interfere with 
the deductions of behavior theory. 


Sensory Interaction and Stimulus Gen- 
eralisation 

The sensory interaction literature 
has a direct relevance for Postulate 
XI: Afferent stimulus interaction 
(Hull, 1952, p. 11). Reviews and 
summaries of this literature (Carter, 
1962; Dufy, 1962; Gilbert, 1941; 
London, 1954; Ryan, 1940) seem to 
indicate that in the majority of 
instances the simultaneous or near- 
simultaneous extraneous stimulation 
results in the lowering of sensory 
thresholds. Also, when subjects are 
required to estimate stimulus magni- 
tudes presented at supraliminal levels, 
simultaneous extraneous stimulation 
results in the overestimation of these 
magnitudes, In a small number of 
cases one also finds absolute threshold 
increase or magnitude underestima- 
tion. About a third of the experi- 
ments show no effects. 

While these threshold and estima- 
tion effects are probably manifesta- 
tions of the same perceptual conse- 
quences of the extraneous stimulation, 
in that the obtained threshold changes 
reflect changes in the perceived or 
“effective” stimulation, rather than 
improvement or deterioration of the 
sensory capacity, it is the stimulus 

* The effects of factors such as extraneous 
stimulation or drive on absolute threshold 
and magnitude judgments are subject of a 
separate problem. It might be pointed out in 
passing, however, that when we observe a 
lowering of absolute threshold which obtains 
as a consequence of some manipulation, such 
as increased drive level, forexample, the results 
cannot always be interpreted as an “improve- 
ment“ in perception. The same manipulation 
introduced when the subject is making 
magnitude judgments of these stimuli, 
supraliminally presented, may result in the 
overestimation of these magnitudes. Over- 
estimation can hardly be interpreted as an 

improvement,“ since it represents a decre- 
ment in Accu 
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magnitude over- and underestimation 
that has a more direct bearing upon 
the afferent stimulus interaction pos- 
tulate. The postulate reads as follows: 


All afferent impulses (s's) active at any given 
instant, mutually interact converting each 
other into s's which differ qualitatively from 
the original 3's so that a reaction potential 
(Ex) set up on the basis of some afferent 
impulse (s) will show a generalization fall to 
Er when the reaction (R) is evoked by the 
other afferent impulse (5) [Hull, 1952, p. 1140 


While the postulate assumes a 
qualitative change in the molar im- 
pulse which arises when the original 
stimulus is presented together with 
previously neutral stimuli, it cer- 
tainly must, as a special case, subsume 
quantitative changes. For in the last 
analysis a qualitative change is noth- 
ing but a cluster of quantitative 
changes too plex to be con- 
veniently described in quantitative 
terms. Allowing the postulate to deal 
with quantitative changes in the 
molar impulse will enable us on the 
basis of sensory interaction literature 
te me new implications. 

-onsi a given sensory interac- 
tion effect on stimuli lying on a par- 
ticular intensity continuum, A. Stim- 
ulus intensities on this continuum are 
known to be overestimated when 
judged in the presence of some extra- 
neous stimulation, B. Low magni- 
tudes of the extraneous stimulation, 


As paraphrased by Hilgard (1956) the 
afferent stimulus interaction postulate reads 
more clearly: 


When a response has been conditioned to a 
stimulus (Si) and one or more previously 
neutral stimuli (Sz, Sz, ...) are presented 
along with Sı, the afferent impulses from 
this combination of stimuli interact to yield 
a new molar impulse (s). This new im- 
pulse is equivalent to a stimulus at a 
greater or less distance from Si on a qualita- 
tive continuum. The resulting generali 
reaction potential to s will be smaller than 
to Si, depending upon their remoteness 
from each other. 


Perception, Drive, and Bewavior Tory 


Bi, result in minimal overestimation, 
and high magnitudes, B., result in a 
larger overestimation of the A stimuli, 
Suppose further that a response is 
conditioned to a particular A stimulus 
when the latter is presented together 
with Bı. Following training, this A 
stimulus is again presented but now 
together with Bz, and performance is 
observed, Since the perceived or 
“effective” intensity of the A stimuli 
is known to increase the presence of 
Bu, performance to the particular CS 
will show a generalization fall when 
the extraneous stimulus intensity is 
increased. Thus far the prediction is 
entirely in agreement with the afferent 
stimulus interaction postulate. How- 
ever, since we are dealing with pre- 
sumably known quantitative changes 
in the “effective” stimulation, we 
are also in a position of making 
specific predictions about generalized 
responses to A stimuli other than the 
particular CS; for instance, CS — a, 
CS — 2a, CS — 3a, „ CS+a, 
CS + 2a, CS + 3a, , where d is 
a magnitude on the A dimension, ex- 
pressed perhaps in log units to satisfy 
Postulate X (Hull, 1952, p. 11). 

The generalization fall in perform- 
ance to the CS had been obtained 
because its perceived intensity in- 
creased when it was presented to- 
gether with Bz. But other A stimuli 
will also be perceived as more intense 
in the presence of Bz than in the 
presence of Bi. Since CS—a, CS—2a, 
++, would in the presence of B: ap- 
pear more intense, they would become 
effectively more similar to the trace of 
the CS maintained from training, 
where it was presented in the com- 
pany of the weaker extraneous stimu- 
lation, Bi. Let us, for the purposes of 
argument, assume that the psycho- 
Physical change in the perception of 
the A stimuli occurring when Bı is 
replaced by Ba is equivalent to one a 
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unit, and that such changes are more 
or less uniform for the range of values 
employed in the generalization test, 
such that any given stimulus inten- 
sity on the A continuum, Sa, when 
presented with B, is perceptually 
equivalent to Sa, +a, when B, is 
replaced by Bs Thus, generalized 
responses to CS - a, CS — 2. 
will be more probable (or stronger) 
when obtained in the presence of B, 
than when obtained in the presence of 
Bi. Responses to CS — a obtained 
in the presence of B, might be as 
probable as responses to the CS itself 
when the latter is presented in the 
company of Bı. Responses to CS 2a 
obtained in the presence of B, will 
be as probable as those obtained to 
CS - ain the presence of By, etc. 

Since CS + a, CS + Za. , would 
in the presence of B, also increase in 
their perceived intensity by one a 
unit, they would becomes less similar 
to the trace of the original CS, and 
the generalized responses elicited by 
them would be weaker than when 
observed in the presence of Bı. Con- 
sequently, for the stimulus continuum 
employed in the test, we obtain not a 
fall in the generalization gradient but 
a lateral displacement of the gradient 
toward the lesser stimulus intensities 
of the A continuum, with mode at 
CS — a. 

These predictions conflict with ones 
derived from Postulate XI. Consider 
only two stimuli, the CS and CS — a, 
which for the purpose of simplicity 
we will now refer to as Sı and Ss, 
retaining the relation Sı > Ss. Since 
the original training occurs in the 
presence of Bı, the trained response 
is attached to the compound Sin. 
Above we predicted that the general- 
ized reactions to stimuli weaker than 
the CS will be more probable when 
obtained under increased extraneous 
stimulation than when obtained under 
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constant extraneous stimulation, or in 
Hullian terms that 5ER > s.5,Er. 
Essentially then we are predicting 
that in the case of a two-dimensional 
stimulus generalization, a test stimu- 
lus which differs from the CS on one 
dimension will produce a lower reac- 
tion potential than a test stimulus 
which differs from the CS to an equal 
extent on that dimension, but which 
also differs from the CS on one other 
dimension ! 

But we are dealing here with 
stimulus-intensity continua and there- 
fore considerations deriving from stim- 
ulus-intensity dynamism (V) may 
complicate matters. A more critical 
examination of the Hullian prediction 
is therefore called for. If drive (D) 
and incentive (K) are held constant, 
we have the generalized reaction po- 
tential to S+, obtained when extrane- 
ous stimulation remains the same 
from training to test, S:B,Er = s,n,ER 
X 10-#8 X Ven, where j is an em- 
pirical generalization constant, ds the 
difference between S, and Ss in log 
units, and Van the stimulus-intensity 
dynamism of the combined S, and Bi 
stimulation. The generalized reac- 
tion potential to the same test stimu- 
lus but obtained under increased ex- 
traneous stimulation can be written 
as follows: SER = s,p,Er X 10-45 
X Ven, where i is another empirical 
generalization constant, d the differ- 
ence between Bi and By in log units, 
and Vex, the stimulus-intensity dy- 
namism of the combined S, and B: 
stimulation. To obtain the prediction 
based on sensory interaction effects 
from Hullian theory we must prove 
that 2ER D 82 B Ex, or SaB, ERX 10-42 
X Vos, > SER. But since both 
10-42 and Vez, are equal to or smaller 
than unity, the above relation js 

false according to Hull’s theory. 
When a response is trained to an 
A stimulus in the presence of the 
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stronger extraneous stimulation, Bz, 
and generalization tests are performed 
in the presence of the weaker extrane- 
ous stimulation, sensory interaction 
effects suggest that a mirror image of 
the former prediction will be obtained. 
Now the generalization gradient will 
show a lateral displacement to- 
ward the greater stimulus intensities 
with the peak of the gradient at 
CS +a. Under these conditions the 
responses to stimuli stronger than the 
CS will increase in probability when 
extraneous stimulation is decreased 
during generalization testing. Here 
the Hullian model is even less likely 
to predict an increment in generalized 
reaction potentials as a consequence 
of the decrement in extraneous stimu- 
lation. Now the difference between 
the CS and the testing stimulus as 
well as the reduction in the total 
stimulus-intensity dynamism both act 
to reduce sEp. 

In an experiment whose attempt 
Was to examine the above predictions 
(Dorfman, 1961a), the training CS 
and the generalization stimuli were 
slight electric shocks (from —.38 to 
—.02 log milliampere) of 1-second 
duration. The extraneous stimula- 
tion was a 45-decibel tone for Bi and 
a 96-decibel tone for Bs, both at 
1,000 cps, occurring also for a 1-sec- 
ond duration and simultaneously with 
the electric shocks. The distance be- 
tween the stimuli selected for the 
generalization test (—.06 log milli- 
ampere) was established on the basis 
of a previous psychological investiga- 
tion (Dorfman, 1961b) in which there 
were indications that a change from 
Bu to Ba resulted in an overestimation 
of shock intensities of approximately 
that order. 

One group of subjects were trained 
under low extraneous stimulation, and 
another group under high extraneous 
stimulation. In order to avoid the 


problem of asymmetrical generaliza- 
tion gradients a procedure developed 
by Brown, Clarke, and Stein (1958) 
was employed. One verbal response 
(win) was reinforced to the —.20 
log milliampere shock, and another 
(LOSE) to the shock of the lowest 
(—.38 log milliampere) and to the 
shock of the highest intensity (—.02 
log milliampere).6 The three train- 
ing stimuli were presented one at a 
time in 15-second intervals, and 
© simultaneously with either BI or B., 
depending on the experimental condi- 
tion. In both groups subjects were 
given 12 training trials to the middle 
Stimulus, and 7 training trials to 
each of the two extreme stimuli. 
The subjects were reinforced for 
correct responses by the experimenter 
saying “right” and for incorrect re- 
sponses by the experimenter’s saying 
wrong.“ e In the generalization test 
each of the seven stimuli was pre- 
= sented six times: three times with B, 
and three times with Ba in a random 
sequence. During test no reinforce- 
ment was given, however, inter- 
Spersed among the testing trials were 
six reconditioning trials. The data 
for this experiment are reproduced in 
Figure 1, where it is clear that a 
lateral displacement of the gradients 
occurred when the extraneous stimu- 
lation was changed from training to 
test. When subjects are trained in 
the presence of low extraneous stimu- 
lation the modes and the medians of 
their generalization gradients are at 
significantly lower shock intensities 
for tests taken in the presence of 
ong extraneous stimulation than 


This experiment was also performed using 
a conventional generalization procedure in- 
volving only one response, and essentially the 
same results were obtained (Karp, 1961). 
There were fewer than 10% of incorrect 
ponses during the initial training, the dis- 
ination performance reaching 100% ac- 
acy toward the end of the training. 
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for tests taken in the presence of weak 
extraneous stimulation. Conversely, 
when training occurs under high ex- 
traneous stimulation, and when the 
test is given in the presence of the 
weaker tone, the modes and medians 
of the gradients are displaced toward 
higher shock intensities. When ex- 
traneous stimulation is increased gen- 
eralized reactions to stimuli weaker 
than the CS gain in probability of 
emission, and when it is decreased 
reactions to stimuli stronger than the 
CS gain in probability. These effects 
were uniform throughout the course 
of the generalization tests. For pur- 
poses of comparison, the Hullian 
predictions for this — + — training 
procedure were computed, assuming 
the usual constants for generalization 
and for V (Hull, 1952), and using a 
formula suggested by Hull (1951, 
p. 94) for computing the combined V, 
involving both the S and the B in- 
tensities. These predictions are shown 
alongside the empirical results, and it 
is clear that the results are in disagree- 
ment with the implications of the 
Hullian model. 


Drive and Stimulus Generalization 


There has always been some am- 
biguity in predicting the effects of 
drive shift upon generalization. The 
energizing component of drive (D) 
leads to one set of predictions and the 
stimulus component of drive (Sp) to 
another. Because of the multiplica- 
tive laws, E=DXH and E DH. 
one would expect increments in gener- 
alized responses and in the generaliza- 
tion slope when drive is increased 
following training, and decrements 
when drive is decreased. No dis- 
placement of the modal values of the 
generalization gradients would be 
expected. In support of this predic- 
tion Spence (1958) reports an experi- 
ment by Newman (1955) who ob- 
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served generalization along the size 
dimension for rats whose hunger level 
was either increased or decreased 
following training. 

From the point of view of the drive 
stimulus, Sp, however, one would ex- 
pect a decrement in response strength 
regardless of whether drive is in- 
creased or decreased following train- 
ing. Since increments as well as 
decrements in drive represent changes 
in the molar impulse, the law of 
afferent stimulus interaction applies, 
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Fic.1. The effect of change in extraneous stimulation (loudness) on generalization along 
the shock dimension: Experimental results and theoretical predictions from Hull. 
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predicting that any change in a 
molar impulse produces a generaliza 
tion fall. It seems that Hull must 
have given more weight to the role 
of Sp, for in Corollary XI he states: ; 


When a habit is set up in association with @ 
given drive intensity and its strength 
tested under a different drive intensity, tht 
will result a falling gradient of Hu and sER 
(Hull, 1951, p. 90). 


Apparently this corollary was esta F 
lished solely on the basis of empirica 


Perception, Drive, anp Benavior Tueory 


results such as those of Yamaguchi 
(1952), for one aspect of the corollary 
seems to contradict the formal model. 
The corollary is consistent with the 
model with respect to the prediction 
of a falling slg gradient. However, 
it is difficult to see that the model can 
also predict a falling Ex gradient. 
Consider a reaction potential set up 
under a given drive level, Dı, and a 
subsequent test under an increased 
drive, Də. The corollary predicts 
that sspEr > ssp,r, where Sp, and 
Sp, are the two drive stimuli. Ex- 
panding we have: ssp, X Di X Vi 
> ssp Hr X 10-482 X DX Vi, where b 
is an empirical generalization con- 
stant, and dsp the difference between 
the drive stimuli. The above rela- 
tion simplifies to Dı > 10-*8» X Ds. 
Fixing D, at 1.0 and assuming that 
Sp is equal to log D, we obtain the 
result that the empirical generaliza- 
tion constant b must be smaller than 
—1.0 for the relation to hold. Since 
the empirical results on generalization 
consistently indicate that the con- 
stant is seldom smaller than —.15, 
the corollary cannot be justified. 
Moreover, the corollary conflicts with 
the data reported by Newman (1955). 
On the other hand, working with the 
stimulus sampling model and treating 
drive as a set of irrelevant stimulus 
elements, Estes (1958) arrived at a 
prediction for generalization gradients 
obtained after a drive shift. Con- 
trary to the above inferences, incre- 
ments in drive following training were 
Predicted by him to result in a shal- 
lower gradient with response prob- 
ability somewhat depressed at the CS, 
while decrements in drive in a steeper 
gradient, also with a depressed re- 
sponse probability at the CS. 

If, in the case of drive shift, the 
Most significant aspect of drive is, 
as Estes argues, its stimulus element, 
Sp may be viewed simply as an ex- 
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trancous stimulus such as we dealt 
with in the above experiment by 
Dorfman (1961a), If as such Sp 
leads to the overestimation of other 
stimulus magnitudes, then an incre- 
ment in drive following training 
should result in a lateral displacement 
of the generalization gradient in the 
direction of lower stimulus magni- 
tudes, and a decrement in drive in the 
direction of higher stimulus magni- 
tudes. We would thus expect essen- 
tially the same effects whether we 
manipulate some irrelevant extrane- 
ous stimulation of irrelevant drive. 
It should be noted that neither the 
inference based on Sp effects, nor one 
based on the multiplicative law pre- 
dict a lateral displacement of the 
generalization gradient following a 
drive shift. 

In the present program of research 
some experiments dealt with this 
problem (Carter & Zajonc, 1962; 
Matlin, 1960; Platz, 1962; Platz & 
Zajonc, 1961; Zajonc & Cross, 1964), 
In one of the experiments (Platz, 
1962) subjects were first given differ- 
ential fear conditioning in a series of 
29 trials in which one of two lights 
was followed by a fairly strong elec- 
tric shock. The duration of the light 
was .5 second, and its separation from 
the shock offset varied randomly from 
zero to five seconds for each subject. 
The shock intensity was set according 
to the tolerance of each subject, found 
prior to the fear conditioning. Meas- 
ures of GSR were taken to assess the 
relative effectiveness of the differen- 
tial fear conditioning. Following this 
fear conditioning essentially the same 
procedure as that in Dorfman's ex- 
periment (1961a) was employed. A 
particular verbal response (HIT) was 
reinforced over a series of 15 trials 
to a 45-decibel tone and another 
verbal response (MISS) was reinforced 
to a 42.5-decibel and to a 50.0-decibel 
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Fic. 2. The effect of change in drive (fear) on generalization 
along the loudness dimension. 


tone over a series of 12 trials each. 
In one group of subjects the middle 
and the extreme stimuli were pre- 
sented together with the previously 
shocked light (with the shock now 
absent), and in another group the 
three training stimuli were presented 
with the previously nonshocked light. 
As in Dorfman’s experiment, follow- 
ing the — + — training Platz ob- 
served the proportion of HIT responses 
to seven test stimuli ranging from 42.5 
to 50.0 decibels. All auditory stimuli 
were given at 1,000 cps. The test 
stimuli were given in a random order 
in 15-second intervals. The shock 
(or nonshock) light was of .5-second 
duration, and it preceded the tone 


stimulus by one second. On half of 
the trials the test stimuli were pre- 
sented together with the shocked light 
(but without the shock), and on the 
other half in the presence of the non- 
shocked light. Interspersed among the 
generalization trials were also some 
reconditioning trials to maintain the 
differential fear reaction to the two 
lights. On each generalization trial 
GSR measures were taken. 

Figure 2 shows the results of the 
drive shift upon the proportion of 
generalized HIT responses. A lateral 
displacement similar to that found 
when extraneous stimulation was ma- 
nipulated is observed. When drive is 
increased from training to test, the 
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curve is displaced toward the weaker 
tones. When drive is decreased, 
however, the gradients show a dis- 
placement in the direction of the 
louder tones. But Platz’s results, 
although at an acceptable level of 
significance, seem to be less pro- 

he average median of the gradient of the 
group trained under high drive was displaced 
in the direction of higher stimulus intensities 


by .68 decibel when the test stimuli were 
given with the nonshock light, while the 


nounced than those of Dorfman. 
When drive is manipulated the peaks 
of the gradients did not shift as was 
the case when extraneous stimulation 
was manipulated. There is, however, 
a basic difference between the two 


median of the group trained in the nonsbocked 
light was displaced in the direction of lower 
intensities by .77 decibel when the stimuli 
were given in the presence of the shocked 
light. This effect was significant at 001 
level (F = 36.85, df = 1/38). 
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Fic. 3. The effect of change in drive (fear) on generalization along the loudness 
dimension for “good” and “poor” conditioners. 
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manipulations. In Dorfman's ex- 
periment the difference in the levels 
of extraneous stimulation were con- 
stant for all subjects. In the experi- 
ment manipulating drive, the differ- 
ences in conditioned fear depended 
on the success of the previous differen- 
tial fear conditioning and its extinc- 
tion status during the training and 
testing of responses to the auditory 
stimuli. Individuals who conditioned 
well, in that their GSRs to the shocked 
and nonshocked lights reached and 
maintained a relatively large differ- 
ence, would necessarily be under the 
influence of a relatively greater drive 
shift than individuals whose psycho- 
galvanic reactions to the two lights 
were not very different. The sub- 
jects in the study were accordingly 
separated into “good” and “poor” 
conditioners and their results are 
shown in Figure 3. It is immediately 
clear that the generalization dis- 
placement occurring as a function of 
drive shift obtains primarily for 
“good” conditioners. 

Another related finding was ob- 
tained in Platz’ study which is of 
some interest. If a subject emits 
the response which was originally 
trained to the CS when a weaker tone 
is presented, he is essentially “mis- 
taking” it for the CS. His response 
is HIT while it should have been miss, 
One could argue that such “mistakes” 
are facilitated when the drive-pro- 
duced stimulation present at the 
moment is relatively high. Here the 
high drive-produced stimulation in- 
creases the apparent or “effective” 
intensity of the weak tone and thus 
makes it more similar to the CS, On 
the other hand, HIT responses to 
stimuli louder than the CS will occur 
when the drive-produced stimulation 
is relatively low. Here the low drive- 
produced stimulation decreases the 
apparent intensity of the test stimu- 
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lus and thereby makes it similar to 
the CS. To the extent that the GSR 
measures the intensity of this dri 
produced stimulation, we would e 
pect that the GSR obtained when 
subject emits a HIT response sho 
decrease with the intensity of the 
stimulus. The level of GSR obse 
when HIT responses were given 
subjects to the seven auditory stimuli 
are shown in Figure 4. The relation 
expected from the sensory interactio 
point of view is clearly evident, T 
effect is of particular significance be- 
cause, other factors held constant, 
normally we obtain an increase in 
GSR with an increase in stimulus: 
intensity. 
In other experiments with pigeons 
using hunger (Zajonc & Cross, 1964) 
or electric shock (Carter & Zajone, 
1962) as the drive-producing stimulus 
or with human subjects, using dy- 
namometer tension as the drive- 
producing stimulus (Matlin, 1960, 
significant lateral displacements 
generalization gradients were also 
found as a consequence of drive shift. 


O 


Sensory Interaction and Discrimina- 
tion Behavior ‘ 


The experiments reported earlier 
used with human subjects a general” 
ization procedure often involving dis 
crimination learning. Another seri 
of experiments in the present resear 
program examined some implications 
of sensory interaction and dri 
effects upon discrimination learning 


Consider two stimuli, Sı and 5 
slightly different in intensity, wi 
Si < Sz. Suppose that in a succe: 
sive discrimination learning procedu 
Sı is associated with one response, N 
such as pressing one button, and 
with another response, R», such 
pressing another button. Suppose 
further that Sı and Sz are alwa — 
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Fic. 4. Mean GSR (in micromhos) accompanying “hit” 
responses as a function of stimulus intensity. 


Presented together with some ex- 
traneous stimulation Bı, low in in- 
tensity, Correct responses to the 
stimuli are rewarded and incorrect 
punished. The pairs SıBı and S2By 
occur in a random sequence and 
equally often over a series of trials. 
Interspersed among these trials, how- 
ever, are also trials on which the two 
stimuli, Sı and Sz, one at a time, are 
Presented simultaneously with ex- 
traneous stimulation, Ba, considerably 
More intense than Bi. These trials 
occur in a random sequence and as 
often as those accompanied by Bı. 

henever $,B, or SB are presented 
neither positive nor negative rein- 
orcement is administered, although 
the subjects are still required to 
Select one of the two responses. We 
are concerned with the prediction of 


discrimination performance with re- 
spect to the nonreinforced pairs, S:B2 
and S2B2. 

Since we are dealing with discrimi- 
nation learning, the critical element 
of the situation is the distance be- 
tween Sı and Sz. It was noted above 
that sensory interaction effects lead 
to changes in the perceived intensities 
of stimuli, and we must therefore 
consider not the physical but the 
“effective” or the perceived distance 
between stimuli. The assumption 
that extraneous stimulation increases 
the perceived intensities of Sı and Sz 
leads us to predict that the “‘effective”’ 
distance from SzBi will be greater 
for the reinforced SıBı than for the 
nonreinforced SıBə. The change in 
the “effective” distance obtains be- 
cause the perceived intensity of Sı 
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increases when it is accompanied by 
B. Consequently, performance to 
S,B, will suffer. However, from the 
same assumption it follows that the 
“effective” distance from SB, will be 
smaller for the reinforced SB, than 
for the nonreinforced S: Bi. The 
effect obtains because the perceived 
intensity of Sa increases when it is 
presented in the company of Ba. 
Consequently, the performance to the 
pair SB: will gain. Assuming that 
the reaction potentials for the training 
pairs SiBI and S: Bi are about equal, 
we obtain the prediction that the 
proportions of correct responses should 
be ordered as follows: S: B > S:Bı 
= SıBı Sig. This prediction holds 
that the highest proportion of correct 
responses should obtain for a stimulus 
compound, S.: Be, to which responses 
were never reinforced! And it is 
again in conflict with one derived 
from the Hullian system. 

It suffices to show that at least one 
of the inequalities of the above rela- 
tion cannot be derived from Hull's 
postulates. Let us examine the ability 
of the Hullian model in predicting 
82ER: < 8:B:É Ry where sp is the 
net discriminatory reaction potential. 
In terms of the Hulliam law of dis- 
crimination learining (Hull, 1952, pp. 
85-92) we have the net discrimina- 
tory reaction potential for S: Bi: 


25ER = s2B,ER, — s2B,ER,. 


Assuming that the product D X K is 
constant: 


55ER s8, Hr Lai s,5,Hn, 
X Vis, X V28,X 107s, 


where the first subscript for V refers 
to the discriminanda Si and So, and 
the second to the extraneous stimuli, 
Bi and B: (V being the stimulus- 
intensity dynamism of the combined 
Sand B stimulation), b is an empirical 
generalization constant, and ds the 
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distance between Sı and Sa Y 
also have: 


5,8: R,= s:B:ER: > s28,En,, 
= sypllr,.X V: Bz b J 

Ja SaB RX Vs 

= sp HR, X Ven, X 10-8 


X Vzgr; SzhH rX VAM 
X 107248 X Va 
SEHR V2n,X 10 ; 
X Viba sibl X Visi 
X107 X Von, a 
X 10-48 Vai 
where a is another empirical ge 
ization constant, and d the distat 
between Bi and Be. Let us assum 
that the training was sufficiently lol 
such that s,5,4r, and SHA 4 
equal to unity. Dividing both 
of the inequality under examina 
by Vez, X 10-2 X Vor, we have: 


1+ Viz, 10s 


Von, X 10-242 <1 Vıp, 10 
2 


Since the denominator of the le 
side of the inequality must be smalle 
than unity, the inequality is false. 
Consider further a situation whe 
the reinforcement is administered onl} 
in the presence of the stronger e 
traneous stimulation, and we 4 
concerned with the prediction of Į 
formance to the nonreinforced pait 
SıBı and SBI. Since a decrease 1 
extraneous stimulation results in th 
“effective” weakening of Sı and 
when they are presented with B. 
distance between the pairs SiBs í 
SB. will be smaller than be 
SıBı and SzBe, and greater t 
between the pairs SIB and 5 
Here then we predict that the high 
percentage of correct responses W 
obtain for the nonreinforced 
SiB; Thus, when reinforcem 
given only in the presence of B: 
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net discriminatory reaction poten- 
tials are ordered as follows: Ex. 
EER SEE It can 
be shown, as above, that the Hullian 
model predicts S1α f < SEER. 
Several experiments using the above 
paradigm were recently carried out, 
and their results seem rather con- 
sistent with expectations. In each 
experiment one group of subjects 
were reinforced only in the presence 
of weak, and another only in the 
presence of strong extraneous stimu- 
lation. In all experiments the dis- 
criminanda were presented for a 
1-second duration and simultaneously 
with the extraneous stimuli. In one 
of the experiments (Dorfman, Platz, 
& Zajonc, 1962), mild electric shocks 
(.92 and 1.32 milliampere) served as 
the discriminanda and tones of weak 
and strong intensities as extraneous 
stimuli. In another, two stimuli 
slightly differing in loudness (51 and 
55 decibels, 1,000 cps) were used as 
the discriminanda and two levels 
of illumination provided extraneous 
stimulation. In a third experiment 
two similar levels of brightness of a 
patch of light were used as discrimi- 
nanda, and two levels of auditory 
stimulation served as B, and Bz. 
On each trial one of the four S,;B; 
combinations occurred, all equally 
often and in a random sequence over 
a series of 128 trials. Always, in one 
group of subjects reinforcement was 
given in the presence of Bi only, and 
in another in the presence of Bz only. 
The percentage of correct responses 
to the S. B. pairs found in these three 
experiments is summarized in the 
upper portions of Figure 5. The two 
first experiments clearly show the 
expected pattern of response prob- 
abilities. The results of the third 
experiment are somewhat weaker, 
although the same trend is still to be 
observed. In the second experiment 
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(where auditory stimuli served as the 
discriminanda and illumination as 
the extraneous stimulation, and when 
reinforcement was administered under 
Bij the difference in correct responses 
between the nonreinforced pairs SB: 
and SB: is over 30%, with correct 
responses to S, being only slightly 
better than chance and those to S:B; 
reaching almost 90%. It should be 
noted that the above response meas- 
ures are based always on the entire 
set of trials, including early blocks 
when performance to all four SIB! 
combinations does not exceed the 
chance level. 

It might be argued that in the 
present series of discrimination ex- 
periments the subjects may have used 
the irrelevant extraneous stimulation 
as an additional, albeit improper, cue. 
That is, for instance a subject who is 
only reinforced in the presence of the 
weaker stimulus, and is asked to 
choose between Ri and Ra when the 
pair S,B, is presented, might when 
uncertain be tempted to select Rs 
rather than the correct R, simply 
because the extraneous component of 
the pair is clearly strong, and because 
R, was always reinforced to the 
stronger of the two stimuli, namely S. 
Performance to SB would therefore 
suffer. When presented with SB: 
and also uncertain, he might also 
choose Ra in preference to Ri for the 
same reason. This tendency, how- 
ever, would benefit the performance 
to S:B, The opposite tendencies 
may obtain when reinforcement is 
given only in the presence of Bs. 
Here performance to 5S,B; would 
benefit and to SzBi suffer. To ex- 
amine these possibly confounding 
effects, the second of the previous 
discrimination experiments was re- 
peated using the same two auditory 
stimuli as Sı and Ss, and the same 
two levels of illumination as Bı and 
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EXPERIMENT I 
Sı ond Se ` Electric shock 
B, ond D ` Loudness 


EXPERIMENT I 


S: ond i Loudness 
B, ord Bz | Brightness 


EXPERIMENT M 


Sı ond Sz ` Brightness 


B, owd Ba Loudness 


EXPERIMENT N 
Sı ond Sp « Loudness 
B, od Bz- Boghtness 


percent of correct responses 
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Fic. 5. Discrimination performance under extraneous 
stimulation in four experiments. 


Ba, and the following preexperimental 
procedure. Before the experiment 
proper and before introducing Si and 
Sz at all, the subjects were trained to 
respond to Bi and B, in the absence 
of the discriminanda. In one group 
(Compatible Reinforcement) the re- 
sponse which was later to be asso- 
ciated with the weaker S, was rein- 
forced to Bi and the response to be 
later associated with the stronger $, 
to Bi. In another group (Incom- 
patible Reinforcement) the response 
to be later associated with the weaker 
Sı was reinforced to Bs, and that to 
be later associated with Sz to BI. 
In a third group (Random Rein- 
forcement) half of the R, responses 


was randomly reinforced to Bı and 
half to By, and similarly for the R: 
responses. This training was carried 
out for 60 trials. In the second part 
of the experiment a discrimination 
learning procedure identical to that 
in Discrimination Experiment II was 
used. 

In all groups the effect found pre- 
viously was observed. As can be seen 
in the lower portion of Figure 5, 
the pattern of differences between the 
Proportions of correct responses is 
most pronounced for the Compatible 
Reinforcement group, showing that 
in the original experiments at least a 
part of the effect may be attributed to 
the tendency of responding on the 


E basis of extraneous stimulation when 
a discrimination with respect to S, 
and S: is difficult. However, the 
presence of the differential response 
percentages to the four SB; pairs, 
consistent with previous findings and 
found for all three groups, strengthens 
the conclusion that the effects pre- 
viously obtained were in large part 
due to a sensory-interaction influence. 


CONCLUSIONS 


The results of the experiments 
presented above seem to argue for an 
increased role of perception in Be- 
havior Theory. In particular they 
argue for an increased role of the 
afferent stimulus interaction postu- 
late, although the postulate may have 
to be somewhat modified. Afferent 
impulses (s’s) present at a given time 
do seem to interact and seem to 
convert one another. But it is also 
apparent that, at least with respect 
to the continua used in the above 
Tesearch, they convert one another 
into §’s which are not qualitatively but 
quantitatively different from the orig- 
inal s’s. Consequently, while the 
prediction of the fall in reaction 
potential (Eu) resulting when the 
response is elicited by & is correct, 
One cannot readily extend this pre- 
diction to the generalized reaction 
Potential, Ex. Depending upon the 
Nature of the afferent interaction, the 
test stimulus, and the levels of ex- 
traneous stimulation present during 
training and testing, Ex may either 
Show a fall or a rise. Exact predic- 
tions of Ex and sEp would therefore 
require exact knowledge of the specific 
-Sensory-interaction effects which ob- 
tain for the particular eliciting and 
extraneous stimuli under considera- 
ton. Unfortunately, there are at 
the moment no systematic principles 
rning such  sensory-interaction 
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effects. For each pair of stimuli 
specific previous knowledge of inter- 
action effects must be available. 

The afferent stimulus interaction 
postulate holds specifically that 
E = N X 104, where j is an 
empirical generalization constant, and 
d the difference between s and § 
(Hull, 1951, p. 94). The only modifi- 
cation required is the addition of the 
qualification that at times a general- 
ization gain obtains when responses 
are made to §, and that d be con- 
sidered in terms of “effective” or 
perceived rather than in terms of 
physical distance. 

The increased significance of the 
afferent stimulus interaction postu- 
late may have wider implications. 
Perceptual constructs and variables 
have never played important roles in 
Behavior Theory. Among the formal 
principles of Behavior Theory only 
the afferent-stimulus-interaction and 
the stimulus-intensity-dynamism pos- 
tulates might be “perceptual” or 
“quasi-perceptual” in nature. Of 
these two, the former remained largely 
idle, while the latter was shown to be 
in conflict with experimental results 
(Nygaard, 1958), and ultimately un- 
necessary for the explanation of the 
data which prompted its introduction 
(Champion, 1962; Logan, 1954; Per- 
kins, 1953). The idleness of the 
afferent stimulus-interaction postu- 
late placed the eliciting stimulus in a 
rather enviable position among other 
variables. We began to view it as a 
stable and dependable component of 
the system. When experimental vari- 
ables popular in behavioral research, 
and in particular drive, are manipu- 
lated, they are conventionally viewed 
as leaving the eliciting stimulus un- 
touched. Rather, the principal atten- 
tion is directed to how these variables 
affect the output side. Sensory-inter- 
action data indicate that the same 
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stimulus administered under two 
different background conditions is 
simply not the same stimulus. At 
least, it is not perceived or reacted 
to as if it were the same stimulus. 

A further implication follows from 
the research presented in this paper. 
It is quite well established that the 
strength of learned responses is mark- 
edly influenced by drive (Spence, 
1956). It is also rather well estab- 
lished that with drive level held 
constant, the intensity of the eliciting 
stimulus influences the strength of 
learned responses in very much the 
same way. Estes (1958) made this 
observation in a somewhat different 
context. The former effect has been 
commonly dealt with by means of 
the multiplicative law. The latter 
effect, on the other hand, has been 
originally treated in terms of stimulus- 
intensity dynamism and later by 
means of the discrimination principle 
elaborated by Perkins (1953) and 
Logan (1954). In the light of the 
results presented here, the communal- 
ity of drive and stimulus-intensity 
effects need not remain simply acci- 
dental. The high rate of responding 
under high drive need not be entirely 
due to the energizing force of D. 
These effects might in some degree 
reflect the increased perceived in- 
tensity of the eliciting stimulus, which 
derives from its interaction with the 
drive-stimulus, Although the prob- 
lem is more complex for acquisition, 
here too the possibility exists that 
stimulus intensity and drive can be 
“traded off” for one another because 
the latter increases the “effective” 
intensity of the eliciting stimulus. 
Thus, the effects of stimulus intensity 
and drive on acquisition and per- 
formance may prove to be equivalent 
not only in their consequences, but in 
part also in their immediate ante- 
cedents. 
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The possibility that the drive 
stimulus may produce sensory inter- 
action effects is strengthened by 
recent neurophysiological evidence, 
On the one hand Steller (1954) 
argued that the state of drive of the 
organism is directly correlated with 
the amount of activity in the hypo- 
thalamus. Hebb (1955) and Lindsley 
(1957) also view general level of 
motivation as reflected in cortical 
arousal. On the other hand, man 


neural activation. As a matter of 
fact, afferent stimulus interaction 
at first presented under the label of 
afferent neural interaction. Fuster 
(1958), for instance, reports that 
discrimination is facilitated by a mild 
stimulation of the brain stem, and 
Boswell (1958) that recognition 
thresholds are lowered when ob- 
tained during the excitatory phase 
of the alpha rhythm. Thus, to the 
extent that drive, as well as extraneous 
stimulation, produce increased neural 
activity, and to the extent that such 
arousal interacts with other sensory 
inputs, drive, extraneous stimulati 
and the eliciting stimulus inten 
might all show equivalent effects. 

The equivalent effects of drive and 
extraneous stimuli on stimulus recep- 
tion and transmission may also have 
some implications for the work whi 
since the classical Bruner-Goodn 
experiment (1947) on the estima’ 
of coin sizes, has concentrated 
demonstrating motivational effi 
in perception. Although the role 
motivational influences in percep 
is constantly being challenged 
response-oriented interpretations, 
argument presented here would 
tainly support the possibility 


such as accentuation of stimulus m 
nitudes and the lowering of senso 
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tion. But a departure from the New 
Look formulation is also indicated. 
Traditionally, the general hypothesis 
that motivational factors influence 
perception requires that in order for 
a motivational state to exert some 
effect upon the perception of a 
stimulus, the stimulus must in some 
relevant, perhaps instrumental, way 
be related to the manipulated drive. 
The above restriction, which is essen- 
tial to the New Look interpretation 
of motivational effects in perception 
may be quite unnecessary. Any drive, 
provided its Sp interacts with the 
judged stimulus may exercise such 
effects as magnitude over- or under- 
estimation and changes in recognition 
threshold. This latter thought is 
perhaps less enchanting from clinical 
point of view, but on the other hand, 
there is little reason to doubt it. 
Differential perceptual performance 
associated with personality differ- 
ences need not be interpreted in terms 
of the differential psychological rela- 
tion of the different personalities to 
the stimulus object. Rather, it can 
be viewed as a consequence of differ- 
ences in arousal, drive, or sensitivity 
to sensory-interaction effects. It is 
still possible that when the judged 
stimulus has instrumental significance 
for the motivational state in question, 
some special effects might be obtained. 
It is more likely, however, that these 
effects would be limited to influences 
upon processes peripheral to what 
might be called “immaculate” per- 
ception, such as orienting, attending, 
and the like. 
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A STRUCTURAL BALANCE MODEL OF COMMUNICATION 
EFFECTS* 
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A communication model is presented in which the source, the com- 
munication, the issue, and the receiver are conceived as clements of a 
communication structure and attitudinal and unit relations between 
these elements are investigated. This basic communication structure 
is represented in terms of the mathematical theory of lincar graphs, 
and Heider's principle of structural balance is invoked to specify 
balanced and unbalanced structures. The model is applied to interpre- 
tation of results of a sample of studies from different areas and is 
shown to have considerable power and generality in interpreting 
communication effects. Future avenues of developing the model! are 
noted involving conceptualization of: (a) strength of relations, (b) 
importance of issue, (c) specific effects of imbalance, and (d) indi- 
vidual differences in intolerance of imbalance. 


There is now a considerable body of 
evidence in the psychological litera- 
ture to support the conclusion that a 
person tends to evaluate communica- 
tions in a manner consistent with his 
relevant attitudes. If a communica- 
tion about some issue is consistent with 
a person’s attitude towards that issue 
it is more likely to be accepted than 
rejected. The program of research 
in attitude and communication by Hov- 
land and his colleagues (Hovland, 
Janis, & Kelley, 1953; Hovland & 
Rosenberg, 1960; Sherif & Hovland, 
1961) and the investigations generated 
by Festinger’s (1957) theory of cog- 
nitive dissonance, provide evidence 
which is in general agreement with 
this conclusion. Recent studies con- 
cerned with the evaluation of evidence 
about the cigarette smoking-lung can- 
cer relationship (Feather, 1963) and 
with judgments of the soundness or 
unsoundness of religious arguments 


Sections of this paper were written when 
the author was Visiting Scientist and Re- 
search Associate at the University of Michi- 
gan under the sponsorship of the Institute 
of Science and Technology and National 
Science Foundation Project GS-9. 


(Feather, 1964) are also consistent 
with this conclusion. 

Theoretical models have been pre- 
sented which permit interpretation of 
this tendency for consistency between 
evaluation and attitude* Such a 
tendency could, for example, be in- 
terpreted as one way of reducing cog- 
nitive dissonance (Brehm & Cohen, 
1962; Festinger, 1957) or in terms 
of latitudes of acceptance or rejection 
(Sherif & Hovland, 1961). The pres- 
ent paper explores another possible 
approach which is based on Heider's 
(1946, 1958) theory of balanced 
structures and which, in particular, 
draws upon the reformulation of this 
theory by Cartwright and Harary 
(1956), using concepts from the mathe- 
matical theory of linear graphs. This 
approach therefore considers the re- 
lationship of evaluation to attitude in 
terms of a structural balance model. 
Such a model also has implications 
about mutual attitudes between the 
source and the receiver of the com- 


2The main alternative models are dis- 
cussed in recent papers by Osgood (1960), 
Zajonc (1960), and Brown (1962). 
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munication which may be reflected in 
judgments about credibility. It also 
provides an analysis of the conditions 
under which one might expect attitude 
change to occur. It may therefore be 
viewed as a relatively general model 
of communication effects. We will 
now turn to a statement of this model 
and then to an examination of some 
experimental evidence pertaining to it. 


COMMUNICATION STRUCTURE 


The communication structure with 
which we are concerned consists of 
the following elements: a source (S), 
a receiver (R), a communication (C), 
and an issue (I). The source is de- 
livering a communication about an is- 
sue to a receiver. The communication 
may be verbally presented or written, 
but it takes some position on the issue. 
This is the basic communication struc- 
ture to be considered and is an ab- 
straction from such diverse situations 
as a lawyer presenting an argument 
about the innocence of his client to a 
juryman, a minister preaching about 
religion to a member of his congrega- 
tion, a newspaper editor writing about 
racial integration for a reader, a thera- 
pist giving his interpretation of a par- 
ticular symptom to a client, a politician 
outlining a proposal of agricultural 
policy to a member of his electorate, 
and a teacher presenting a lesson on 
the American Civil War to a pupil. 

It is assumed that we know certain 
attitudinal relations between elements 
of the communication structure. Spe- 
cifically, we assume that the attitudes 
of the source, the communication, and 
the receiver towards the issue are 
known. These attitudinal relations 
are either positive or negative. At 
present no allowance is made in the 

model for variation in the strength 
of the attitude. Only the sign of the 
attitude is considered. The various 
situations presented above may be ex- 


panded to take account of the pos- 
sible attitudinal relations. For ex- 
ample, the newspaper editor may be 
in favor of racial integration, his edi- 
torial may advocate racial integration, 
and his reader may be opposed 
to racial integration. The politician 
may be personally opposed to a change 
in agricultural policy, his speech may, 
in fact, advocate a change in agri- 
cultural policy, and his audience may 
fayor a change in agricultural policy. 

The elements and relations in the 
above communication structure are co- 
ordinated to concepts from the theory 
of linear graphs (Cartwright & Har- 
ary, 1956). In particular, the ele- 
ments are coordinated to points of the 
graph, and the relations to directed 
lines of the graph. The sign of the 
relation is represented by the sign of 
the directed line. The resulting graph 
is called a signed digraph or s-digraph. 
For the communication structure un- 
der discussion, there are eight distinct 
s-digraphs which include all possible 
combinations of positive and negative 
attitudinal relations. These eight s- 
digraphs are presented in Figure 1 
employing the convention that solid 
lines are positive and dashed lines 
negative. The arrows on the lines 
of each s-digraph indicate direc- 
tion and represent the asymmetry of 
the attitudinal relations in that source, 
communication, and receiver are all 
assumed to have attitudes towards the 
issue, but the issue cannot have atti- 
tudes towards them. 

In Figure 1 we represent at one ex- 
treme, in s-digraph a, the structure in 
which source, communication, and re- 
ceiver are all in favor of the issue 
The positive line Sİ means that the 
Source of the communication has a 
positive attitude or is favorably dis- 
posed towards the issue. Correspond- 


ingly, the positive lines RI and CÍ 
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Signed digraphs representing the eight possible combinations of positive and 


negative attitudinal relations of source (S), receiver (R), and communication (C) to 


issue (I). 


mean that both receiver and communi- 
cation are favorably disposed towards 
the issue. The structure in s-digraph 
a could, for example, represent the 
Situation in which a minister with a 
positive attitude towards religion de- 
livers a sermon favoring religion to a 
member of his congregation who favors 
religion. At the other extreme, we 
represent in s-digraph h the structure 
in which source, communication, and 
Teceiver are all opposed to the issue. 
The negative lines SI, Ri, and ci 
represent these uniformly negative at- 
titudinal relations. The structure in 
s-digraph h could, for example, repre- 
Sent the situation in which a conserva- 
tive politician opposed to socialized 
medicine argues against socialized 
Medicine to an audience also opposed 
to this policy. Between the extreme 
Structures of s-digraph a and s-di- 
graph h are intermediate structures 
involving both positive and negative 
attitudinal relations. 


STRUCTURAL BALANCE 


Each of the eight structures in Fig- 
ure 1 represents, in the form of an 
s-digraph, three attitudinal relations 
between elements of the structure, viz., 
source to issue, communication to is- 
sue, and receiver to issue. The atti- 
tudinal relations of source to communi- 
cation, of receiver to communication, 
and of source to receiver (and vice 
versa) are unknown. It is assumed 
that these relations may be predicted 
by applying Heider’s (1958) principle 
of balance. 

Heider proposes this principle in an 
analysis of structures involving two 
or more of the following entities :— 
P (one person), O (another person), 
and X (an impersonal entity). The 
entities may be related attitudinally 
(eg., P likes X, O does not like P), 
or in terms of a unit relationship 
(eg., P owns X, P and O live to- 
gether). Heider (1958), reflecting 
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the general field-theoretical approach, 
assumes that relations within the P- 
ON unit are interdependent and that 
this interdependence is governed by 
a principle of balance. He states the 
conditions for balance as follows: 


A dyad is balanced if the relations be- 
tween the two entities are all positive 
or all negative... Disharmony results 
when relations of different sign character 
exist. 

A triad is balanced when all three of the 
relations are positive or when two of the 
relations are negative and one is positive. 
Imbalance occurs when two of the rela- 
tions are positive and one is negative [p. 
202). 


Heider’s basic hypothesis therefore 
asserts that attitudinal (sentiment) re- 
lations and unit relations tend towards 
a balanced state.“ Such states of bal- 
ance tend to be preferred over dis- 
harmony. Existing relations may in- 
duce new relations in the direction of 
a balanced state, and when a structure 
is unbalanced pressures will arise to- 
wards balance. Transformation from 
a state of imbalance to a state of bal- 
ance may occur by means of changes 
either in the attitudinal relations or 
the unit relations or both. If such 
changes are not possible the state of 
imbalance will produce tension, 

Cartwright and Harary (1956) re- 
formulate the concept of a balanced 
structure using the mathematical 
theory of linear graphs and, in so 
doing, are able to resolve some am- 
biguities in Heider’s (1946) presenta- 
tion and to generalize the principle to 
more complex structures. In particu- 
lar, they assume that an s-digraph is 
completely balanced if all its semi- 


3 Cartwright and Harary (1956, p. 278) 
provide the following examples of units 
balanced according to the above conditions: 
P likes something he made; P likes what his 
friend likes; P dislikes what his friend dis- 
likes; P likes what his enemy dislikes; and 
P’s son likes what P likes. 
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cycles are positive. A semicycle is 
defined as “a collection of lines ob- 
tained by taking exactly one from each 
pair AB or BA, BC or CB. . DE 
or ED, and EA or AE. [Cartwright 
& Harary, 1956, p. 283].” The sign 
of a semicycle is the product of the 
signs of its lines. Figure 2 presents 
the eight balanced s-digraphs which 
correspond to balancing the eight 
structures represented in Figure 1. 

All semicycles are positive in each, 
of the s-digraphs in Figure 2. If we 
consider the semicycle SI IC, Cs, 
for example, we note that the product 
of the signs of its lines is positive in 
each of the eight s-digraphs. Simi- 
larly, by inspection it can be seen that 
all other semicycles in the s-digraphs 
are positive. Therefore all of the s- 
digraphs in Figure 2 are completely 
balanced. 

In each s-digraph the S-I-R unit 
involves three semicycles: SR, RS; 
SR, RÍ, IS; and SR, RI, IS. This 
unit corresponds to the P-O-X unit of 
Heider for the special case where one 
person is the source of a communica- 
tion, the other person is the receiver 
of the communication, and the imper- 
sonal entity is the issue which the com- 
munication concerns. It will be noted 
that symmetric relations are permitted 
between points S and R, i.e., the source 
and the receiver may have attitudes to- 
wards one another. The relations be- 
tween points S and C, and R and C 
are, however, asymmetric since it is as- 
sumed that while both source and 
receiver can have attitudes towards 
the communication, the communica- 
tion does not have attitudes towards 
them. 

The structures represented in Fig- 
ure 2 permit predictions about atti- 
tudinal relations of source to com- 
munication, receiver to communication, 
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Signed digraphs in which the eight s-digraphs of Figure 1 are completely balanced 


for attitudinal relations according to Heider's principle. 


and source to receiver (and vice 
versa), given the information con- 
tained in the s-digraphs of Figure 1 
and the assumption of structural bal- 
ance. Thus, knowing the attitudes of 
source, communication, and receiver 
towards the issue with which the com- 
munication is concerned, it is possible, 
by employing the balanced s-digraphs 
in Figure 2, to predict four attitudinal 
relations: the attitude of source to 
communication, the attitude of re- 
ceiver to communication, the attitude 
of source to receiver, and the attitude 
of receiver to source. These pre- 
dictions concern the sign of the atti- 
tude (whether it is positive or nega- 
tive) but not its strength. We as- 
sume that the attitude of source or 
receiver to the communication is re- 
flected in the way he evaluates the 
communication, i.e. whether he ac- 
cepts or rejects it, agrees or disagrees 
with it, is convinced or not convinced 
by it. Further, we assume that the 


attitude of source to receiver and vice 
versa may be reflected in the evalua- 
tion made of his credibility, i.e., 
whether he is perceived as well in- 
formed, trustworthy, etc. 

For example, let us consider the 
case of the newspaper editor in favor 
of racial integration whose editorial 
advocating racial integration is read 
by a person opposed to this policy. 
We assume also that the editor knows 
the reader's attitude to racial integra- 
tion and vice versa. This communica- 
tion structure is represented in Figure 
1 by s-digraph c, and predictions 
based on the structural balance model 
may be obtained by referring to the 
corresponding balanced s-digraph c in 
Figure 2. There it is apparent that the 
directed line SC is positive, and that 
the directed lines RC, SR, RS are all 
negative. We would therefore predict 
that, for this particular structure, the 
source will be positively disposed to- 
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wards the communication he is send- 
ing, the receiver will be negatively dis- 
posed towards the communication, and 
source and receiver will be negatively 
disposed towards one another. Assum- 
ing that these attitudes will be reflected 
in evaluation of the communication and 
judgments of credibility, we would 
predict that the newspaper editor will 
agree with his editorial, his reader will 
tend to disagree with the editorial, 
finding it unconvincing, and neither 
editor nor reader will regard the other 
as well informed about the issue. 

In summary, given information 
about the attitudes of source, commu- 
nication, and receiver towards an issue, 
the structural balance model permits 
prediction of the attitudes of source 
and receiver towards the comunication, 
and the attitudes of source and re- 
ceiver towards one another. These 
attitudes are assumed to be reflected 
in evaluation of the communication and 
in judgments of the credibility of the 
source or the receiver. 


EXTENDED MODEL 


To this stage we have limited our 
statement of the model to the particu- 
lar case where attitudinal relations of 
source, communication, and receiver 
towards the issue are specified and the 
attitudinal relations of source and re- 
ceiver to communication, and source 
and receiver towards each other, are 
predicted. But it should be apparent 
that the type of approach described 
above can be extended to deal with 
cases where other sets of attitudinal 
relations between elements of the com- 
munication structure are specified and 
the remaining attitudinal relations are 
predicted using the principle of struc- 
tural balance. For example, given the 
attitudinal relations of receiver to 
source, receiver to issue, and commu- 
nication to issue we can use Figure 2 
to predict the attitudinal relations of 
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receiver to communication, source f€ 
issue, source to receiver and source to 
communication. In short, the set 0 
attitudinal relations represented in Fig 
ure 1 is only one among several po: 
ble sets from which one could predic 
the balanced attitudinal relations in 
Figure 2. 

Even allowing for this greater flexi 
bility, however, the model so far d 
veloped is oversimplified in that it 
not taken account of unit rela 
within the communication struc 
As we have noted, Heider (1958) 
siders both attitudinal and unit rela- 
tions in his theory of balance, and 
Cartwright and Harary (1956) are 
able to represent both types of relations 
using the mathematical theory of linear 
graphs. Heider (1958) defines unit 
formation as follows: “separate entities 
comprise a unit when they are per- 
ceived as belonging together. For ex- 
ample, members of a family are se 
as a unit; a person and his deed belo 
together [p. 176].” Cognitive unit f 
mation would refer to such specific 
lations between entities as similaril 
proximity, ownership, causality, or be- 
longing. 

In the present context, we will c 
sider unit relations between source 4 
communication, between receiver 4 
communication, and between sO 
and receiver.“ Figure 3 presents 
eight balanced s-digraphs in w 
both attitudinal and unit relations are 


Unit relations of source to issue, rec 
to issue, and communication to issue 
also be included in Figure 3 but have ne 
been so included since we are not sure jus 
what these unit relations might represent. 
Positive unit relations of source and rece! 
to the issue might be represented in $ 
sense of “ownership” of the issue, or api 
Priateness of the issue to the person. 
Positive unit relation of communication 
issue might be reflected in a judgment 

communication is pertinent or rele 
to the issue, 
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represented. A positive unit relation 
is represented by a solid brace; a nega- 
tive unit relation is represented by a 
dashed brace. Following Heider we 
assume that a positive unit relation 
implies unit formation and that a nega- 
tive unit relation implies a resistance 
to unit formation. 

The s-digraphs in Figure 3 are of 
Type 2 (Cartwright & Harary, 1956), 
since two different relations (unit and 
attitudinal) are defined on the same set 
of elements. In contrast, the s-digraphs 
in Figures 1 and 2 are of Type 1. Fol- 
lowing Cartwright and Harary (1956, 
p. 285) we define an s-digraph of Type 
2 as balanced if all its semicycles are 
positive.” All semicycles are positive 
in each of the s-digraphs in Figure 3. 
For example, if we consider the semi- 
cycle consisting of RI, IC, and the unit 
relation CR in each of the eight s-di- 
graphs, we note that in all cases the 
product of the signs of its lines is posi- 


Other assumptions are possible (eg., 
Morrissette, 1958). 


Signed digraphs representing the eight communication structures in which all 


tive. Similarly, by inspection it can be 
seen that all other semicycles in the 
s-digraphs are positive, irrespective of 
whether they involve only attitudinal 
relations, unit relations, or both types 
of relation. Therefore, all of the s-di- 
graphs in Figure 3 are completely 
balanced. 

The model presented in Figure 3 has 
considerable predictive power. In fact, 
it is possible to predict all the relations 
represented in each s-digraph in Fig- 
ure 3 providing three relations are 
known which involve the four ele- 
ments (S, C, I, R) of the communica- 
tion structure. For example, knowing 
the attitudinal relation of S to I, the 
attitudinal relation of R to I, and the 
unit relation of S to C, one can predict 
all other relations within the commu- 
nication structure. Itis obvious, how- 
ever, that an adequate test of the above 
model will depend upon the possibility 
of coordinating the theoretical concepts 
to empirical operations. In particular, 
it is necessary to have some operational 
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specification of both unit and attitudi- 
nal relations for the communication 
situation investigated. How are these 
relations varied in an experiment and 
how are changes in these relations re- 
flected in overt behavior? 

We believe that it is possible to vary 
these relations experimentally and to 
observe the effect of such variation in 
overt behavior. With respect to atti- 
tudinal relations, for example, it would 
be possible in an experiment to vary 
the attitudinal relation of source and 
receiver towards the controversial is- 
sue of civil rights by deliberately se- 
lecting a liberal Democrat to act as 
source and a member of the Ku Klux 
Klan to act as receiver. The attitude 
of the communication towards the is- 
sue could be varied by using a strong 
pro-civil rights message in one situa- 
tion and an anti-civil rights message in 
another. If we were predicting the 
attitudinal relations of source and re- 
ceiver towards the communication and 
the attitudinal relations of source and 
receiver towards one another, we could 
observe the way each evaluates the 
communication (e.g., acceptance ver- 
Sus rejection) and the evaluative reac- 
tions (positive or negative) of one 
toward the other. 

Similarly, one could operationally 
specify a positive unit relation between 
source and receiver by selecting sub- 
jects who are, for example, members 
of a strong cohesive group. A nega- 
tive unit relation could be specified by 
choosing as source and receiver sub- 
jects who are members of rival factions. 

The coordination of unit relations 
between source and communication and 
between receiver and communication to 
experimental operations is less obvious 
but one possibility will be discussed in 
some detail because of its importance 
in the analysis of communication ef- 
fects. Within an experiment it is pos- 

sible to vary the degree to which a 
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source considers himself responsible 
for sending a communication, and the 
degree to which a receiver considers 
himself responsible for exposing him- 
self to a communication. When, for 
example, a source is not coerced in any 
way to present a communication but 
performs the activity voluntarily, “of 
his own free will and accord,” then he 
may be regarded as responsible for the 
communication. He has ownership 
over it. When, however, the source 
is coerced to present the communica- 
tion by external forces such as promise 
of reward for compliance or threat of 
punishment for noncompliance, when 
he presents the communication “against 
his will,” then he may be regarded as 
not responsible for the communication. 
We assume that responsibilty of a 
source for a communication may be 
represented by a positive unit relation. 
A negative unit relation would imply 
that the source is not responsible for 
the communication and, in fact, dis- 
owns it. 

The concept of responsibility has 
some antecedents in the psychological 
literature. In the particular context of 
achievement behavior, for example, a 
related variable is the degree to which 
a person believes that he personally 
controls his success or failure, the de- 
gree to which he considers that suc- 
cess or failure is due to his own skill 
and efforts and not to the operation 
of chance or external agencies (cf. 
Feather, 1959). Similarly, in the com- 
munication situation the source’s re- 
sponsibility would depend upon the 
degree to which he is able to control 
both his acceptance or rejection of the 
role of source and his continuance in 
that role. Where external forces are 
such that they primarily determine his 
role as source then he may not fee 
responsible for the communication. 
Similar ideas are presented in a recent 
publication by Brehm and Cohen 
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(1962) where they suggest that an 
important factor that may control the 
magnitude of dissonance in a situation 
is the degree of volition involved. 
They define degree of volition as: 

the extent to which a person feels that he 
controls his own behavior. . Volition 
implies not only initiation and selection of 


behavior but also responsibility for its con- 
sequences [p. 201]. 


Similarly, one can represent a re- 
ceiver’s responsibility for exposing 
himself to a communication by a unit 
relation. A positive unit relation 
would represent the case where the 
receiver has actively and voluntarily 
exposed himself to a communication. 
For example, a person of his own ac- 
cord attends a lecture advocating the 
virtues of Communism. In contrast, 
a negative unit relation would repre- 
sent the case where the receiver is 
coerced by external forces to receive 
the communication. For example, a 
captured soldier is forced by threat of 
punishment to listen to a lecture extol- 
ing the virtues of Communism. Many 
communication studies in psychology 
involve some degree of coercion since 
the receiver of a communication is 
often paid to participate as a subject. 
Volunteer subjects, knowing the con- 
ditions of the experiment and not in- 
duced to participate for monetary re- 
ward, would presumably feel more 
responsible for their commitment to 
the receiver’s role. 

The examples given above of possi- 
ble ways of operationally specifying 
attitudinal and unit relations within a 
communication situation are not meant 
to be exhaustive. These relations can 
Probably be specified in a variety of 
Ways and it is an empirical problem to 
determine which operations best fit the 
model. For example, in order to in- 
duce a positive attitudinal relation be- 
tween receiver and source, one might 
Introduce the source as a very warm 
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and likable person, or as a trust- 
worthy and well-informed authority, 
In line with the above discussion, one 
might represent by a positive unit rela- 
tion the situation where a receiver has 
voluntarily exposed himself to a par- 
ticular source. One might consider 
“assimilation” and “contrast” effects in 
a receiver’s reactions to a communica- 
tion as reflecting positive and negative 
unit relations, respectively. Many of 
these operations have already been in- 
vestigated in studies of communication 
and attitude change.* 

We will now consider a selection of 
these experimental studies, represent- 
ing different theoretical approaches, to 
show how they may be interpreted in 
terms of the structural balance model 
developed above. 


Some EXPERIMENTAL STUDIES 


The studies to be discussed are clas- 
sified as follows: (a) Studies in which 
no deliberate attempt is made to induce 
imbalance in the communication struc- 
ture; (b) Studies in which there is a 
deliberate attempt to induce imbalance 
in the communication structure. Both 
sets of studies are relevant to the 
validity of the structural balance model. 


Studies without Induced Imbalance 


In his section we will consider three 
investigations, one of our own studies 
concerned with the evaluation of a 
communication, a study by Kelman 
and Hovland (1953) concerned with 
the effects of variations in source credi- 
bility, and a study by Hovland, Har- 
vey, and Sherif (1957) concerned with 
assimilation and contrast effects in re- 
actions to communication. 

Feather. This study was designed 
to investigate attitudinal relations in- 


6 Newcomb’s (1959, 1961) work using the 
A-B-X model is of particular interest in 
indicating how balanced structures may de- 
velop over a period of time. 
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volving source, receiver, and commu- 
nication, given the attitudinal relations 
represented in Figure 1. Subjects 
were involved in a role playing situa- 
tion in which they were required to 
identify with either the source or the 
receiver of a communication about a 
controversial issue and to predict his 
reactions. Each subject was given in- 
formation about the attitudes of the 
source, the receiver, and the commu- 
nication towards the issue. This in- 
formation was presented in the eight 
possible combinations of attitudinal re- 
lations represented in the s-digraphs of 
Figure 1, using a random order of 
presentation. All subjects were re- 
quired to predict whether the source 
and receiver would tend to agree or 
disagree with the communication. Sub- 
jects identifying with the source were 
required to predict whether the source 
would tend to regard the receiver as 
well informed or not well informed 
about the issue. Subjects identifying 
with the receiver were required to pre- 
dict whether the receiver would tend 
to regard the source as well informed 
or not well informed about the issue. 
Thus, the attitudinal relations of 
source and receiver towards the com- 
munication were assumed to be re- 
flected in their evaluation of the com- 
munication. The attitudinal relations 
of source and receiver towards one an- 
other were assumed to be reflected in 
judgments of credibility of source or 
receiver. 

The results of this experiment pro- 
vided strong support for the structural 
balance model. They indicated that a 
person’s evaluation of a communication 
will tend to be positive when his atti- 
tude towards the issue is of the same 
sign as the communication’s attitude 
towards the issue, and negative when 
these attitudes are of opposite sign. 
Similarly, the results showed that a 
source (or receiver) will tend to judge 


N. T. FEATHER 


a receiver (or source) as well informed 
about an issue when he knows that they 
both have attitudes towards the issue 
which are of the same sign, and not 
well informed when he knows that 
they both have attitudes towards the 
issue which are of opposite sign. The 
results also indicated that a person’s 
judgments of credibility are likely to 
be relatively less balanced than are his 
evaluations of the communication. We 
believe that this relative difference re- 
flects the fact that the attitudinal rela- 
tion of source to receiver (or vice 
versa) may be manifested in a variety 
of ways depending on the situation. 
For example, a receiver may have the 
same attitude as the source towards a 
certain issue but he may have reason 
to doubt that the source’s attitude is 
based on reliable evidence. In this 
case, the predicted positive attitudinal 
relation of receiver to source may 
manifested in some form other than a 
judgment that the source is credible 
(eg., in a certain liking or sympathy 
for the source, a positive attraction for 
the source despite the realization that 
he may not be well informed). Such 
alternative manifestations of an attitu- 
dinal relation are probably less likely 
where the relation of person to com- 
munication is concerned. Here the 
relation is more likely to be expressed 
in terms of agreement or disagreement 
with the communication. 

Kelman and Hovland (1953). We 
will limit our discussion of this study 
to that aspect concerned with the ef- 
fects of variation in source credibility 
on evaluations of a communication. 
High school students listened to a te- 
cording of an educational program in 
the course of which a guest speaker 
gave a talk favoring lenient treatment 
of juvenile delinquents. To vary tlie 


credibility of the source the speaker 
was introduced in three different way 


In the positive credibility condition he 
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was identified as the presiding judge 
in a juvenile court, author of several 
books on delinquency, ete. In the neu- 
tral credibility condition he was intro- 
duced as a member of the studio 
audience, chosen at random. In the 
negative credibility condition he was 
presented as a person with a shady 
past, a near-delinquent himself, out on 
bail on a dope-peddling charge, etc. 
Among questions asked of subjects 
after the speech was the following: 
“Was the guest’s presentation fair or 
was it one-sided?” The results indi- 
cated that the percentage of subjects 
who thought the communicator’s pres- 
entation was “completely fair” or 
“fair” decreased substantially as the 
credibility of the communicator de- 
creased, 

The results may be understood in 
terms of the two balanced s-digraphs 
in Figure 4. In s-digraph a the posi- 
tive attitudinal relation of receiver to 
source represents the positive credi- 
bility of the source, the positive unit 
relation of source to communication 
assumes that the source is seen as re- 
sponsible for the communication, and 
the positive attitudinal relation of re- 
ceiver to communication balances the 
S-C-R triad. The latter attitudinal re- 
lation is assumed to be reflected in the 
subject’s judgment that the communi- 
cation has been fairly presented. In a 
corresponding way, the negative atti- 
tudinal relation of receiver to commu- 
nication in s-digraph b balances the 
S-C-R triad, which represents the case 
in which a negatively credible source is 
regarded as responsible for a one-sided 
communication. 

Hovland, Harvey, and Sherif (1957). 
In this investigation each subject was 
first required to indicate his stand on 
a controversial issue (prohibition and 
repeal in a “dry” state) by checking 
which of nine representative statements 
about the issue best corresponded to 
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Fic. 4. Balanced s-digraphs representing 
the effect of variation in source credibility 
on receiver's evaluation of communication. 


his own position. These statements 
about the issue ranged from an ex- 
treme “dry” stand to an extreme “wet” 
stand. Each subject also checked those 
statements which he did not find ob- 
jectionable from his point of view, 
those statements which he did find ob- 
jectionable, and those statements which 
were neither acceptable nor unaccept- 
able to him. This procedure provided 
information about his latitude of ac- 
ceptance (the range of statements tol- 
erated) and his latitude of rejection 
(the range of statements rejected). 
Subsequently a wet (repeal) commu- 
nication was presented to extreme dry 
subjects and unselected subjects, a dry 
(prohibition) communication was pre- 
sented to extreme wet and unselected 
subjects, and a moderate communica- 
tion was presented to wet, dry, and 
unselected subjects. Following the 
communication the nine statements 
were again presented to each subject 
to secure his stand, his latitude of 
acceptance, and his latitude of rejection 
concerning the issue. His reactions to 
the communication were also obtained 
through ratings on like-dislike, reason- 
able-unreasonable, biased-unbiased, and 
propaganda-fact dimensions. Subjects 
who received the moderate communica- 
tion also checked on a rating scale 
ranging from extreme dry to extreme 
wet positions what they thought to be 


Fic. 5. Balanced s-digraphs representing 
cases where communication is within re- 
ceiver’s latitude of acceptance (s-digraphs 
a and b) and within his latitude of rejection 
(s-digraphs c and d). 


the stand taken in the communication 
itself. 

The results indicated that there was 
an extremely close relationship be- 
tween the person’s own stand concern- 
ing the issue and his evaluation of 
the communication. Hovland, Harvey, 
and Sherif (1957) conclude that, 


The data provide quantitative information 
.. that individuals who are in favor of the 
opinion advocated will consider the commu- 
nication fair and unbiased, but that those 
with an opposed stand will regard an iden- 
tical communication as propagandistic and 
unfair [p. 247]. 


In discussion of this result they sug- 
gest that when a communication falls 
within the latitude of acceptance it will 
tend to be judged as fair and unbiased. 
As it is perceived as moving more and 
more into the latitude of rejection there 
is an increasing tendency for it to be 
considered unfair and propagandistic. 
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The results of this investigation also 
showed that subjects with extreme 
stands on the issue tended to perceive 
the moderate communication as fur 
ther removed from their own stand 
than it actually was (a „contrast“ 
effect). Subjects with stands close to 
the position advocated in the moderate 
communication tended to perceive the 
communication as closer to their own 
stand than was actually the case (an 
“assimilation” effect). A 
The results of this study may be 
interpreted in terms of the balanced 
s-digraphs in Figure 5. Since no in⸗ 
formation is provided in the study 
about the credibility of the source, no 
relation ĩs introduced between S and R. 
We assume that a communication falls 
within a person’s latitude of acceptance 
when the attitudinal relations of both 
receiver and communication to the is- 
sue are of the same sign, and within 
a person’s latitude of rejection when 
these relations are of opposite sign. 
Considering attitudinal relations, Fig- 
ure 5 shows that the R- I- C triad would ~ 
be balanced by a positive attitudinal 
relation when the communication falls 
within the individual’s latitude of ac- 
ceptance (s-digraphs a and b) and by 
a negative attitudinal relation When 
the communication falls within the in- 
dividual’s latitude of rejection (s-di- 
graphs c and d). The results are con- 
sistent with these predicted attitudinal 
relations, assuming that a positive at- 
titudinal relation of receiver to com- 
munication may be reflected in a judg- 
ment that the communication is fair 
and unbiased, and that a corresponding 
negative attitudinal relation may be 
reflected in a judgment that the com 
munication is unfair and propagandis 5 
tic. The “assimilation” and “contrast 
effects obtained in this study indicat 
that the receiver may misinterpret 
attitudinal relation of the communica- 
tion to the issue when the comm 
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cation is somewhat moderate or inter- 
mediate in its stand. With respect to 
the present analysis the assimilation 
effect may be interpreted as a tendency 
for the receiver to misperceive the at- 
titudinal relation of communication to 
issue so that it is seen as closer to his 
own attitude about the issue than is 
actually the case, when the communi- 
cation falls within his latitude of ac- 
ceptance (s-digraphs a and b). The 
contrast effect may be interpreted as 
a tendency for the receiver to misper- 
ceive the attitudinal relation of com- 
munication to issue so that it is seen 
as more distant from his own attitude 
than is actually the case, when the 
communication falls within his latitude 
of rejection (s-digraphs c and d). It 
may also be possible to consider the 
assimilation effect as a manifestation 
of a positive unit relation between re- 
ceiver and communication (s-digraphs 
a and b) and the contrast effect as a 
manifestation of a negative unit re- 
lation between receiver and communi- 
cation (s-digraphs c and d). 

It will be noted that the above in- 
terpretation does not enable precise 
representation of the size of the dis- 
crepancy between the stand taken by 
the receiver of a communication and 
the position taken in the communica- 
tion. Discrepancy is considered as a 
difference in the sign of the attitudinal 
relations. A more refined analysis of 
this type of experiment would involve 
taking account of the differing 
strengths of the attitudinal relations 
as well as the differing signs. As we 
will later indicate, this is one im- 
portant direction in which the struc- 
tural balance model needs to be de- 
veloped. 


Studies with Induced Imbalance 


In this section we will first consider 
a discussion by Hovland, Janis, and 
Kelley (1953) concerned with the ef- 
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Fic. 6. Unbalanced s-digraphs (a and b) 
and balanced s-digraphs (c and d) repre- 
senting the case where attitudinal relations 
of receiver and communication to issue are 
at variance. 


fects of varying source credibility. 
This will be followed by a discussion 
of Tannenbaum’s investigation of at- 
titude change (Osgood & Tannen- 
baum, 1955) and Festinger and Carl- 
smith’s (1959) study of the effects 
of presenting an attitude-discrepant 
communication. 

Hovland, Janis, and Kelley (1953). 
These authors are particularly con- 
cerned (pp. 40-46) with the situation 
in which a positive communicator pre- 
sents a communication which, on its 
own merits, would tend to be rejected. 
The two possible forms of this struc- 
ture are represented in s-digraphs a 
and b in Figure 6. 

These two s-digraphs represent 
situations where the source is respon- 
sible for a communication (positive 
unit relation), where the receiver is 
positively disposed towards the source 
(positive attitudinal relation), but 
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case where attitudinal relations 
and communication to issue are 
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where the attitudes of receiver and 
communication towards the issue are 
at variance. These structures are both 
in a state of imbalance since not all 
the semicycles can be positive at the 
one time. For example, the structure 
corresponding to semicycle RC, Ci, IR 
in s- digraph a would be balanced if RC 
were negative, since the semicycle 
would then be positive in sign. How- 
ever, a negative directed line RC would 
mean that the semicycle involving 
SR, RC, and the positive unit relation 
SC would be negative, i.e., the struc- 
ture corresponding to this semicycle 
would be out of balance. The discus- 
sion by Hovland, Janis, and Kelley 
(1953) can be seen as indicating how 
such imbalances are resolved. The 
possibilities they consider, which in- 
clude dissociating the source and the 
communication, reinterpreting the 
meaning or attitude of the communi- 
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cation, and a change in the receiver's 
attitude towards the source, can read- 
ily be interpreted using the structural 
balance principle. For example, the 
receiver might dissociate the source 
from the communication. This dis- 
sociation would be more likely to oc- 
cur the more ambiguous the evidence 
that the source is responsible for 
the communication. Dissociating the 
source from the communication may 
be represented by a negative unit re- 
lation between source and communi- 
cation. It is evident that, if the unit 
relations in s-digraphs a and b were 
negative rather than positive, these s$- 
digraphs would then be in balance. 
This analysis also suggests that a re- 
ceiver probably expects that the atti- 
tudes of both source and communica- 
tion towards an issue should tend to 
be in harmony when the source is obvi- 
ously responsible for the communica- 
tion. Where, however, the receiver 
sees the attitudes of source and com- 
munication towards the issue as at 
variance, he may perceive the source 
as not really responsible for the com- 
munication. 

Figure 6 also presents s-digraphs ¢ 
and d which represent communication 
structures in which complete balance 
is possible. In these two situations 
the attitudes of the receiver and the 
communication towards the issue are 
again at variance, but the receiver is 
negatively disposed towards the source. 
There are four other structures, corre 
sponding to those represented in Fig- 
ure 6, in which the attitudes of Te- 
ceiver and communication towards the 
issue are not at variance. These struc 
tures are balanced providing the re- 
ceiver perceives the source as credible, 
ie., there is a positive attitudinal Bo 
lation of receiver to source. In Fig- 
ure 7, s-digraphs a and b are unbal- 
anced; s-digraphs c and d are bat 
anced. 
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Fic. 8. Balanced s-digraphs (c, d, and e) representing predicted 


changes in relations in unbal 
principle of congruity. 


Osgood and Tannenbaum (1955). 
In his test of the congruity model 
Tannenbaum first required subjects to 
evaluate six cognitive objects consist- 
ing of three sources (Labor Leaders, 
Chicago Tribune, and Senator Robert 
Taft) and three objects (Legalized 
Gambling, Abstract Art, and Acceler- 
ated College Programs). Communi- 
cations consisting of “highly realistic 
newspaper stories” were then con- 
structed involving the following source- 
concept pairs: Labor Leaders—Legal- 
ized Gambling; Chicago Tribune-Ab- 
stract Art; Senator Robert Taft-Ac- 
celerated College Programs. Each 
group of subjects read a favorable 
newspaper story for one of these pairs, 
an unfavorable story for a second pair, 
and had no story at all for the third 
pair. After reading the stories they 
again evaluated the six sources and 
concepts. The results showed that 
changes in evaluations following ex- 


lanced s-digraph b according to the 


posure to the communications were 
generally consistent with predictions 
from the congruity model. For ex- 
ample, concepts linked to a highly 
valued source by a favorable assertion 
tended to be more highly valued. 

The procedure used in this study 
may be represented by using the s- 
digraphs in Figure 8. Let us sup- 
pose, for example, that in an experi- 
ment a subject gives a high positive 
evaluation to President Johnson (the 
source), and a high negative evalua- 
tion to Red China (the issue). This 
situation is represented in s-digraph a 
by a positive attitudinal relation of 
receiver to source and a negative at- 
titudinal relation of receiver to issue. 
The subject then reads a newspaper 
account stating that President John- 
son has made a speech (the commu- 
nication) praising Red China. This 
new information is represented in s- 
digraph b by a positive unit relation 
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between source and communication 
signifying that President Johnson is 
responsible for the speech, and a posi- 
tive attitudinal relation of communi- 
cation to issue (the speech is in favor 
of Red China). The structure in s- 
digraph b is out of balance. The re- 
maining s-digraphs c, d, e, and f in 
Figure 8 show how the structure would 
become balanced if one relation were 
to change. For example, s-digraph c 
would represent the balanced situation 
where the receiver is now in favor of 
Red China rather than against Red 
China. 

Osgood and Tannenbaum’s model 
predicts that changes will occur in the 
receiver’s evaluations of President 
Johnson and Red China. President 
Johnson will be less positively valued, 
and Red China will be less negatively 
valued. These predicted changes would 
be based on the congruity formula. 
They would then be corrected to take 
account of the amount of “incredulity” 
involved in the situation, and for the 
fact that the assertion applies to the 
issue rather than to the source. With 
respect to the structural balance model 
this approach is equivalent to saying 
that a number of relations may change 
in the communication structure to pro- 
duce congruity. Typically, change 
would not be restricted to one relation 
only. Changes will tend to occur in the 
attitudinal relations of receiver to 
source and receiver to issue. These 
predicted changes would have to take 
into account the “assertion constant 
[Osgood & Tannenbaum, 1955, p. 
50]” and reactions of incredulity. 
Changes in the attitudinal relations of 
receiver to issue and receiver to source 
are represented in s-digraphs c and d 
in Figure 8. The reaction of in- 
credulity may be represented in s- 
digraph e by a negative unit relation 

between source and communication. 
For example, the receiver may assume 
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that President Johnson was not really 
responsible for the speech but that 
he was induced by political pressure 
to make the statement. In allowing 
for multiple changes in relations the 
Osgood and Tannenbaum model in- 
volves a combination of the balancing 
reactions represented in s-digraphs e, 
d, and e. 

Although Osgood and Tannenbaum 
deal with differing evaluations of 
source and issue in their model, they 
do not provide for variation in the 
intensity of the assertion. “President 
Johnson lavishes praise upon Red 
China” would presumably be treated 
in the same way as “President John- 
son mildly praises Red China.” This 
difference could be conceptualized in 
the structural balance model as a dif- 
ference in the strength of the atti- 
tudinal relation of communication to 
issue although, as indicated previously, 
the present model has yet to be de- 
veloped to deal with differences in the 
strength of relations. It is interesting 
to note that s-digraph b would be bal- 
anced if the attitudinal relation of com- 
munication to issue were negative (as 
in s-digraph f). This type of effect 
is likely to occur in situations where 
the communication is sufficiently 
ambiguous to allow misinterpretation 
by the receiver. A much qualified 
statement by President Johnson which 
is, in fact, mildly favorable towards 
Red China may be interpreted as an- 
tagonistic to Red China by the re- 
ceiver. 

Finally, s-digraphs a and b in Fig- 
ures 6 and 7 represent incongruous 
situations in Osgood and Tannen- 
baum’s sense. Since, according to our 
assumptions these s-digraphs are un- 
balanced, incongruity here corresponds 
to structural imbalance and both 
models would predict that changes 
would occur in the direction of con- 
gruity or balance. The s-digraphs ¢ 
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Fic. 9. Unbalanced s-digraphs (a and d) and corresponding bal- 
anced s-digraphs (b, c, and e, f) showing changes in attitudinal and 


unit relations. 


and d in Figures 6 and 7 are balanced 
according to our assumptions. Even 
here, however, Osgood and Tannen- 
baum would expect changes to occur 
in the attitudinal relations of receiver 
to source and receiver to issue if these 
relations were initially different in 
strength. The structural balance 
model at present makes no provision 
for such changes. Their conceptuali- 
zation would depend upon further de- 
velopment of the structural balance 
model to take account of relations 
differing in intensity.’ 

Festinger and Carlsmith (1959). 
In this experiment subjects were re- 


The example discussed in this section 
would be conceptualized as an unbalanced 
+p— bond in Rosenberg and Abelson’s 
(1960) balance model. Such a bond could 
also be represented by the unbalanced s-di- 
graph b in Figure 8. Rosenberg and Abel- 
son’s model focuses upon the signs of rela- 
tions while Osgood and Tannenbaum’s model 
also considers the strength of relations. 
Brown (1962) notes a number of other im- 
portant differences between the two models. 


quired to perform a boring and tedious 
task and then to tell the “next sub- 
ject” that the task was interesting and 
enjoyable. Some subjects were of- 
ferred $1.00 to engage in this attitude- 
discrepant behavior, others were of- 
fered $20.00. After they had made 
the discrepant statements, subjects 
rated how interesting and enjoyable 
they found the experiment. A control 
group of subjects, not required to 
make attitude—discrepant statements, 
also rated the experiment for how 
enjoyable it was. Festinger and Carl- 
smith found that the control group 
tended to rate the experiment as un- 
pleasant. The mean rating from sub- 
jects in the $1.00 experimental con- 
dition was significantly higher than 
that of the control group, and defi- 
nitely on the positive side of the scale. 
The mean rating from subjects in the 
$20.00 experimental condition was 
only slightly above the mean for the 
control condition, and was significantly 
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below the mean for the $1.00 experi- 
mental condition. Festinger and Carl- 
smith interpret this result in terms of 
dissonance theory, but an alternative 
interpretation in terms of the struc- 
tural balance model is possible. 

The communication situation investi- 
gated by Festinger and Carlsmith cor- 
responds to s-digraph a in Figure 9, 
where the source initially has a nega- 
tive attitudinal relation towards the 
issue (in this case the experiment), 
and the communication he presents has 
a positive attitudinal relation towards 
the issue. The attitudinal relation of 
source to receiver is not specified in 
the experiment. We assume that 
initially the source would evaluate the 
communication negatively. However, 
the $1.00 experimental condition is as- 
sumed to have the effect of imposing 
a positive unit relation of responsi- 
bility between source and communica- 
tion which is resistant to change since 
the ‘positive inducement to engage in 
the discrepant behavior is low.“ The 
structure represented in s-digraph a is 
unbalanced. We assume that there is a 
tendency for the attitudinal relations 
to be modified in the direction of those 
involved in the balanced structure 


It is likely that some subjects would 
disown responsibility for the attitude-dis- 
crepant behavior even in the $1.00 experi- 
mental condition. They could decide, for 
example, that they actually had little choice 
in the matter since the experimenter would 
expect their cooperation, They therefore 
acted for him rather than of their own 
volition. It would in fact be possible to 
incorporate the experimenter as another ele- 
ment within the structural balance model 
and to represent relations between the ex- 
perimenter and S and the experimenter and 
C (eg., the experimenter is seen by S as 
a trustworthy authority who is obviously in 
favor of C). In some cases S may justify 
attitude-discrepant behavior because a posi- 
tively regarded experimenter is in favor of 
the behavior. This might influence S's ini- 
tial attitude towards the behavior as well as 
his perceived responsibility for the behavior. 
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represented in s- digraph b, i.e., a 
tive attitudinal relation towards 
experiment and a positive evaluation 
of the communication. This type of 
modification is less likely in the $20.00 
experimental condition where the sub= 
ject may feel less responsible for his 
action. Here the positive unit rela- 
tion is likely to be fairly weak since 
the inducement to send the attitudes 
discrepant communication is high 
Hence the source can view his actio 
as an outcome of the high financia 
reward offered rather than as express 
ing his own desire to communicé 
the position advocated. The posi 
unit relation is therefore more ea 
changed in sign. The source can 
own the communication and mainta 
his negative attitude towards the ex: 
periment (s-digraph c). Figure 9 als 
presents a second type of unbalance 
structure (s-digraph d) and the corre 
sponding balanced structures wht 
are likely to develop if the positiv 
unit relation is very strong (s-digraph 
e) or very weak (s-digraph f). The 
above analysis assumes that the po- 
sition taken by the communication 
quite unambiguous and cannot be mi 
interpreted by the source. 
The preceding discussion sugges 


many of the dissonance studies involv- 
ing attitude-discrepant behavior 
terms of the degree to which the pe 
son considers himself responsible f 
the behavior. Brehm and Cohen 
(1962), for example, summarize st 
ies which imply that the magnitude of 
dissonance arousal tends to be greatei 
the less the positive inducement f 
commitment to discrepant behavi 
the greater the choice in commitme 
to discrepant behavior, and the less the 
coercion applied in order to indt 

discrepant commitment. We b 
that each of these experimental op 
ations may influence the degree 
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which the person believes that he is 
personally responsible for his actions. 
Conditions of high positive induce- 
ment, low choice, and high coercion 
would function to reduce the tendency 
for the person to feel that he is re- 
sponsible for his attitude-discrepant 
behavior. These are factors intro- 
duced by others into the situation. A 
person can appeal to these factors to 
justify behavior which is discrepant 
with his attitude. Where positive in- 
ducement is low, degree of choice is 
high, and coercion is minimized how- 
ever, the person is less able to justify 
attitude-discrepant behavior in terms 
of external constraints. Under these 
conditions he is more responsible for 
his behavior. Brehm and Cohen 
(1962) express a similar idea when, 
in discussing experimental evidence, 
they conclude that, The greater the 
degree of volition involved in the oc- 
currence of dissonant cognitions, the 
greater the magnitude of the dissonance 
Ip. 217].” The above analysis of 
Festinger and Carlsmith’s experiment 
suggests that within the structural bal- 
ance model “volition” or responsibility 
may be conceptualized by a positive 
unit relation. 


CONCLUSIONS 


The preceding discussion of experi- 
mental studies shows that the struc- 
tural balance model has considerable 
power in providing the possibility of 
a general conceptualization of com- 
munication effects. We have shown 
that one may interpret studies based 
on different theoretical approaches in 
terms of the one general model. 

In using this model we map into 
the cognitive space of either the source 
or the receiver of the communication. 
If, for example, we are interested in 
the receiver’s reactions to a communi- 
cation then the relations in the com- 
munication structure would be con- 
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sidered from his point of view. 
Similarly, if we are interested in the 
source’s reactions to a communication 
then the communication structure 
would be in terms of his perception of 
the different relations. As Osgood 
(1960) indicates, 

Cognitive interactions, obviously, transpire 
within the nervous systems of single indi- 
viduals, The ‘maps’ we draw to represent 
such interactions necessarily reflect person- 
person and person-object relations as some 
individual perceives them [p. 346]. 

The studies discussed above imply 
the feasibility of varying relations 
within this cognitive structure by ex- 
perimental operations. It is then pos- 
sible to observe behavior which is as- 
sumed to reflect other relations within 
this cognitive structure, or changed. re- 
lations in accordance with the balance 
principle. For example, suppose that 
in an experiment a speaker who was 
introduced as a world authority on a 
particular issue then delivered a speech 
deliberately intended to be at variance 
with the subject’s known attitude to- 
wards the issue. We assume that this 
experimental manipulation would initi- 
ally determine an unbalanced cognitive 
structure such that the receiver would 
be positively disposed towards the 
source, he would perceive the source 
as responsible for the communication, 
and he would see the attitude of the 
communication towards the issue as 
different from his own. We would 
predict that this unbalanced cognitive 
structure would tend to become bal- 
anced, and that changes in the rela- 
tions within the structure would be 
reflected in the receiver’s reactions to 
the communication. The predictions 
in this experiment would therefore de- 
pend upon our assumptions about the 
cognitive structure of the receiver since 
we are interested in his reactions. It 
should be obvious that in any given 
communication situation there may be 
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important differences between the cog- 
nitive structures of the source and the 
receiver in that they may not perceive 
relations within the communication 
situation in the same way. 

There are some important ways in 
which the structural balance model 
may be developed further. They are 
as follows: 


1. Further development of the 
model could consider relations which 
differ in strength as well as in sign. 
At present, the model considers only 
relations which are positive or nega- 
tive. With this development it would 
be possible to deal more adequately 
with investigations involving variation 
in the discrepancy between the sub- 
ject’s stand on an issue and the po- 
sition advocated in a communication 
about the issue (Hovland, Harvey, & 
Sherif, 1957; Sherif & Hovland, 
1961). One could also provide a more 
detailed analysis of research findings 
based upon both the congruity model 
(Osgood & Tannenbaum, 1955) and 
dissonance theory (Brehm & Cohen, 
1962; Festinger, 1957). 

Cartwright and Harary (1956) re- 
fer to the development of a systematic 
treatment of relations of varying 
strength as one of the principal un- 
solved problems of linear graph theory. 
They believe that it is possible to deal 
with the strength of relations by the 
concept of a graph of strength o sug- 
gested by Harary and Norman (1958). 
Morrissette (1958) assumes that the 
strength of a relation may vary con- 
tinuously from +1 through 0 to —1. 
He defines the strength of a cycle as 
the product of the strength of its lines, 
and the sign of a cycle as positive if 
the product of the strength of its lines 
is positive, and negative otherwise. 
Using these concepts he develops a 
formula for the degree of total balance 

of a signed graph. The problem goes 
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beyond defining the strength of a cycle 
or semicycle, however, in that there 
are many other kinds of questions 
which one would seek to answer. For 
example, if the attitudinal relation of 
receiver to issue is strong (e.g., .9) and 
the attitudinal relation of communica- 
tion to issue is moderate (e.g., .5), 
what will be the strength of the attitu- 
dinal relation of receiver to communi- 
cation®? Following Osgood and Tan- 
nenbaum (1955) might there be some 
readjustment in strengths of relations 
within a semicycle according to a con- 
gruity formula? Given semicycles of 
equal length, is a weak negative semi- 
cycle less likely to change than a strong 
negative semicycle? Given relations 
of the same strength (eg., 7), are 
large unbalanced semicycles less likely 
to change than short unbalanced semi- 
cycles? These kinds of questions can 
only be answered by further elabora- 
tion of the model combined with em- 
pirical tests of predictions. 

2. The model could also be devel- 
oped to take account of differences in 
the importance of the issue for either 
the source or the receiver. Dissonance 
theory (Festinger, 1957) allows for 
the importance of the cognitive ele- 
ments involved in a dissonant relation- 
ship (although just how one would 
determine their “weighting” is rather 
unspecified), and there have beet 
some recent studies concerned with 
degree of involvement (¢.g., Zimbardo, 
1960). Sherif and Hovland (1961) 
assume that a strong stand on am 
issue which is personally involving for 
the individual functions as an anchor 
in his judgments so that he is less 


® Where the former two relations are of 
the same sign and both are strong it seems 
likely that the balanced third relation Wo 
be strong. Conversely, where the former 
two relations are of the same sign an i 
are weak it seems likely that the balanced 
third relation would be weak. i 
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likely to be influenced by highly dis- 
crepant persuasive communications. 
This type of variable implies resistance 
to change in the attitudinal relation 
of person to issue. It may, however, 
be an oversimplification to equate the 
importance of the issue with the 
strength of the person's attitude to- 
wards the issue since attitudes may 
presumably be of equivalent strength 
yet differ in their resistance to change. 
For example, one might feel strongly 
about jazz music and strongly about 
scientific integrity but the second atti- 
tude, it is hoped, would be less amen- 
able to change than the first. The in- 
tensity or strength of an attitude is 
probably only one factor governing its 
resistance to change. Katz (1960) 
suggests that other important factors 
are the specificity or generality of the 
attitude, its degree of differentiation, 
the number and strength of its link- 
ages to a related value system, and 
the centrality of the attitude. The 
latter factor refers to the role of the 
attitude “as part of a value system 
which is closely related to the indi- 
vidual’s self-concept [Katz, 1960, p. 
169].” If these different factors are 
to be taken into account it is appar- 
ent that the conceptualization of the 
importance of the issue in terms of 
resistance to change in an attitudinal 
relation presents a formidable task.“ 

3. One aspect of the structural bal- 
ance model that requires some clarifi- 
cation concerns the specific effects of 
a state of imbalance within a structure. 
The relations within a structure are 
assumed to be modified so that the 
structure becomes balanced. But 


10 One possible approach would be to con- 
Sider the issue as one of a set of issues, 
closely bound by unit relations, and towards 
which the receiver has strong attitudinal 
relations. In this case a change in any one 
attitudinal relation might produce imbalance 
in the whole system and would be resisted. 


a b 
F. 10. Unbalanced s-digraphs (a and 


b) differing in strength of attitudinal rela- 
tion of recciver to source. 


which relations will be modified? In 
representing the outcomes of some of 
the experimental studies considered in 
the previous section we assumed that 
the weakest relation in an unbalanced 
structure will tend to change. In 
Figure 10, for example, by this as- 
sumption the attitudinal relation of re- 
ceiver to issue represented in s-di- 
graph a would tend to change to posi- 
tive since it is the weakest relation, 
but in s-digraph b the weakest rela- 
tion represented is the attitudinal re- 
lation of receiver to source and it 
would tend to change to negative. 
Such an analysis would lead one to 
expect that when a highly credible 
source is seen by the receiver as obvi- 
ously responsible for a communica- 
tion having a strong attitude towards 
an issue which is at variance with the 
receiver's attitude towards the issue, 
then the receiver's attitude may 
change. Where, under similar circum- 
stances, the source is only moderately 
credible, the receiver may maintain 
his attitude towards the issue and 
disparage the source. Results of a 
recent study by Aronson, Turner, and 
Carlsmith (1963) tend to be con- 
sistent with this prediction. 

These are, however, other possible 
assumptions. Following Osgood and 
Tannenbaum (1955), one might as- 
sume that multiple changes will tend 


312 


to occur within the structure. Alter- 
natively, one might assume that bal- 
ance will be restored by a minimal 
number of changes in the structure. 
Rosenberg and Abelson (1960), in 
presentation of a balance model, dis- 
cuss evidence which is consistent with 
this assumption. While the “weakest 
relation” and the “minimal change” 
assumptions may lead to similar pre- 
dictions for some communication situ- 
ations, it is possible that there are 
cases where predictions would differ. 
It is obvious, however, that different 
assumptions about specific effects 
within unbalanced communication 
structures need to be explored experi- 
mentally. Dissonance theory (Brehm 
& Cohen, 1962; Festinger, 1957) 
seems to avoid this issue and its re- 
lated experiments generally attempt 
to block off all possible avenues of 
dissonance reduction except the one 
being studied. 

4. Finally, the structural balance 
model needs to take account of indi- 
vidual differences in the degree to 
which persons are able to tolerate in- 
consistencies. One might do this by 
using the concept of a threshold for 
intolerance of inconsistency and by 
specifying the different factors which 
affect the height of this threshold (cf. 
Hovland & Rosenberg, 1960). It is 
likely that this threshold is a function 
of both personality and situational 
characteristics. In a recent study of 
personality variables (Feather, 1964) 
we investigated the effect of differ- 
ences in attitude strength, critical abil- 
ity, and intolerance of ambiguity on 
the tendency to evaluate logical argu- 
ments in a manner consistent with at- 
titude. There is, however, a dearth 
of studies concerned with individual 
differences in reactions to imbalance, 
incongruity, etc., and there is an im- 
portant need for research in this rela- 
tively unexplored area. 
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Tt will be evident that the 
developments in the structural 
model listed above are also rel 
to most of the other consistency 
models. These models face similar 
issues. We believe that these issues 
can be resolved in the structural bal- 
ance model by additional theoretical 
assumptions whose implications are 
checked empirically. This would lead 
to a more complex mathematical model 
with considerable scope. Even now, 
however, the structural analysis of 
communication situations outlined in 
this paper provides something of 4 
unifying approach to the analysis of 
communication effects. It suggests 
also that there are many unbalanced 
communication structures awaiting in- 
vestigation, since Figure 3 implies that 
the number of balanced communica- 
tion structures is quite limited. 

The approach discussed in the pres- 
ent paper is at a “molar” level. It 
specifies the conditions for balanced 
structures but does not attempt to €x- 
plain just why these are the conditions. 
Some suggestions about underlying 
mechanisms have already been made 
in the literature (cf. Hovland & 
Rosenberg, 1960, pp. 223-227), but 
much more needs to be done to ac 
count for the origins and development 
of preferences for balanced and con- 
sistent structures. The present analy- 
sis of structural relations based on the 
theory of linear graphs does, however, 
provide a unifying and testable model 
with many interesting theoretical and 
research implications. 


11 A particular individual's reaction to 1m- 
balance would depend upon the resistance 
to change of relations within his cognitive 
structure. An individual may also be more 
intolerant of imbalance in some situations 
than in others depending, for example, on 
whether or not his evaluation of the commu- 
nication is publicly expressed (cf. Hovland 
& Rosenberg, 1960). 
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NOISE AND WEBER'S LAW: 


THE DISCRIMINATION OF BRIGHTNESS 
AND OTHER DIMENSIONS 


MICHEL TREISMAN! 
Institute of Experimental Psychology, Oxford 


The model of the threshold given by signal detection theory is used as 
a basis for explaining the form of the Weber function. For this purpose 
three sources of noise are considered: spontaneous background noise, 
the irreducible physical variability of the stimulus, and variation in 
the response of the neural pathways simultaneously excited by the ad- 
ministration of a stimulus. It is postulated that the last will show posi- 
tive correlation, i.e., that the central effects of the different elements 
which are simultaneously activated by a stimulus will vary from trial 
to trial in a similar, rather than in independent fashion. If this is 
accepted, a law for the Weber function can be derived which gives, for 
visual brightness discrimination, a square root relation at low stimulus 
intensities with a continuous transition to a linear law at high intensities, 
and which can be applied to explain some features of visual spatial and 
temporal summation. The model also allows Weber's law to be derived 


for other sensory dimensions. 


Weber's law states that AJ/I = k, 
where k is a constant, I is a given 
stimulus intensity, and AJ is the 
just noticeable difference in that in- 
tensity, according to some criterion 
(e.g., that it is detected on 50% of 
trials when it is added to J). But 
the Weber functions (the relations 
between AJ and I) which are de- 
termined experimentally for the differ- 
ent modalities are not well described 
by Weber’s law. The law holds, to 
an approximation, for the midranges 
of many stimulus dimensions, but for 
low values of J, and sometimes for 
high values, the Weber fraction AJ/J, 
tends to increase. The linear general- 
ization of Weber's law, given by 
adding a constant, Im, to J, usually 
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permitting me to reproduce Figure 2. My 
appreciation is due to R. C. Oldfield for the 
provision of research facilities. 


gives a better fit, as was known to 
Fechner (1860); and Guilford (1932) 
has suggested that the power of I 
should be assumed to vary between .5 
and 1. Thus a general expression for 
the Weber function might be 


AI = k(I + In)? 


where k, p, and J, are constants. 

The Weber function helps to define 
the input-output relations of human 
subjects performing discrimination 
tasks. It limits the logical space 
within which possible mechanisms for 
discrimination may be postulated; 
any such model must give this func- 
tion as its output. This paper will 
attempt to show that the model 
based on signal detection theory can, 
with additional. assumptions, ade- 
quately account for the main ob- 
served features of Weber functions. 

The combination of statistical de- 
cision theory (Neyman & Pearson, 
1933) and the hypothesis that succes- 
sive presentations of a single stimulus 
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produce a dispersion of central neural 
effects (Thurstone, 1927a) led to the 
development of signal detection theory 
(Barlow, 1956; de Vries, 1943; Greg- 
ory, 1956; Rose, 1948; Swets, Tanner, 
& Birdsall, 1961; Tanner & Swets, 
1954). Aspects of the theory relevant 
to the arguments to be presented will 
be shortly described. (See Figure 1). 
We assume that when stimuli varying 
on a physical dimension J are pre- 
sented to a subject they produce 
central neural effects whose magni- 
tude is described by E = f(I). The 
dimension E serves as the “decision 
axis”; each stimulus presentation 
produces a magnitude of Æ, and a 
decision rule is applied to this magni- 
tude to determine which response the 
subject will make. When a stimulus 
I’ is repeatedly presented it produces a 
distribution of central effects with 
mean E’ and variance cz. In the 
simplest case it is assumed that this 
distribution is approximately normal, 
and that for stimuli I, I”, I”, lying 
in the small range used to determine 
a single threshold, og = og = ox 
= oy. If on each trial of an experi- 
ment the subject must decide whether 
or not the stimulus presented was 
greater than J’, the decision rule 
would be that he must compare each 
central effect, E, with a criterion, cz; 
if E exceeds this value the response is 
Ves,“ if not, “No.” For trials on 
which a Stimulus Z” > I’ is presented 
the probability of the response “Yes,” 
(FI), would then be given by the 
Integral of the Æ” distribution for 
E> cz; when JI’ is presented the 
Corresponding integral of the E“ dis- 
tribution would give the false positive 
Tate, P(Y|Z’). 

here are a number of alternative 
Methods for computing the optimal 
Criterion, c (Birdsall, 1955); how- 
€ver, the criterion proposed by Ney- 
man and Pearson (1933) has the 
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advantage that it is applicable to the 
common psychophysical procedures, 
and to everyday situations (Treis- 
man, 1964a). It requires that the 
false positive rate must not exceed a 
limiting value, and is satisfied if 
cg = E' + zog, where z is a constant 
such that (VII) equals the limiting 
acceptable false positive rate. This 
does not require that the subject 
know the form of the E“ distribution, 
or the a priori probability of 7”. 

Figure 1 shows the correspondence 
between the distributions on Æ and 
the psychometric function. We can 
consider the Æ’ distribution as a 
“noise” distribution and the Æ” dis- 
tribution as due to “signal + noise.” 
The figure applies to the determina- 
tion of the incremental threshold or 
of the absolute threshold by the 
method of constant stimuli. In the 
latter case J’ = 0, in the former the 
variable stimulus increments, A=, 
are superimposed on a constant back- 
ground intensity, I’ > Io, where Jo is 
the absolute threshold. In each case 
the threshold criterion, P(Y) = .5 
(where P(Y) is the probability of the 
response Ves“), will be satisfied by 
the stimulus Z” = I’ + AJ, for which 
E" = cg =f(I"). When I’ = 0, I” 
is the absolute threshold; when I’ is 
supraliminal AJ is the just noticeable 
difference, and corresponds to zeg. 
The standard deviation of the psycho- 
metric function, o, will correspond 
approximately to I” — I”, where 
is the stimulus for which P(Y) 
= .84; the central effect of J“ is 
ees = Et + og. 

Another procedure sometimes used 
to measure the just noticeable differ- 
ence is to present the standard 
stimulus J’ and a variable stimulus 
(I' Al.), requiring the subject to 
report whether the second is greater 
or less than the first. In this 
case the psychometric function for 
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“Greater” responses will have its 
50% point at J’ (ignoring any con- 
stant error) and the standard devia- 
tion of this psychometric function 
may be taken as the differential 


9 


—— —1—— 


— å I— > 


FIG. 1. 
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; The signal detection model for the meas 

incremental threshold by the method of constant rin, 

tion of a stimulus I produces a dispersion of central effects on the dimension E. 
“This dimension serves as the decision axis“ used to determine the responses 
when threshold discriminations are made. The psychometric function for 
P(Y), the probability of the response “Yes” is shown. The probability of the 
response Yes, P (J), corresponding to J’, the standard stimulus, is the false 
positive rate. I” is the stimulus giving P(Y) = .50; I” —I’ = AL I’ +e 
= I’, where on is the standard deviation of the Paychometric function. With 
these procedures, a» is identical with can the standard deviation of AL ) 


threshold (this corresponds to an 84 
response criterion). If on each oc- 
casion the decision device compare 


the central effect of (J + Al) wi 
E’, then the standard deviation of i 
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psychometric function will again cor- 
respond to og. But if the procedure 
is such that the subject cannot iden- 
tify the standard stimulus as such on 
each trial, and he must operate by 
comparing the particular central 
effects produced by each of the two 
stimuli on each trial, then, if these are 
uncorrelated, the jnd will correspond 
to Mor (Thurstone, 1927b). Thus 
with each method the jnd will be a 
function of og. The value of AZ 
given by a procedure of this sort is 
sometimes referred to as the “differ- 
ential threshold.” However, in what 
follows we shall consider mainly the 
incremental threshold. 


Weber Function 


The signal detection model for 
threshold discrimination has now 
been presented, and we must ex- 
amine whether it can be used to 
explain the Weber function. It fol- 
lows from the model that the relation 
between AJ and J is to be understood 
in terms of the functions relating 
E to I, and cg to E. Thus the sources 
of the noise which produces the dis- 
persion of the central effects of a 
nominally constant stimulus, and 
contributes to its mean effect, must 
be analysed. Three sources of noise 
can be distinguished; in some respects 
they can be best differentiated in rela- 
tion to visual intensity discrimination, 
and this modality will be mainly 
discussed here. 

The importance of spontaneous 
activity in the sensory system in the 
absence of a stimulus was first recog- 
nized by Fechner (1860) who at- 
tributed departures from Weber's 
law at low visual intensities to the 
‘Augenschwarz.” Kuffer, FitzHugh, 
and Barlow (1957) observed main- 
tained discharges in the ganglion cells 
of the cat in complete darkness, and 
they suggested these might arise from 
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the decomposition of visual pigment, 
excitation of fine dendritic terminals 
by random electrical noise, or in 
other ways. Noise due to spontane- 
ous firing, or to the incursion into 
the visual pathways of impulses 
originating outside them independ- 
ently of the stimulus, might also arise 
at various levels of the afferent 
system. For simplicity, however, 
this background noise will be treated, 
following Barlow (1957a), as though 
it is caused mainly by thermal de- 
composition of molecules of visual 
pigment. The spontaneous back- 
ground activity is equivalent in effect 
to a stimulus of small intensity which 
is constantly present, or “dark light“; 
it explains the existence of an absolute 
threshold, and is represented by the 
constant J, in Equation 1. Thus it 
accounts for the improved fit when 
the linear generalization of Weber's 
law is used instead of the classic form. 
The background activity will mainly 
affect the mean of the central distri- 
bution corresponding to a given 
stimulus; it does not account for the 
constants k and p. 

A second important source of noise 
is the irreducible physical variability 
of a stimulus of constant nominal 
intensity. With light, this is the 
variation in the number of quanta 
absorbed from a flash of constant 
intensity (or considered to be ab- 
sorbed from the dark light), which is 
described by the Poisson distribution. 
Rose (1948) compared the eye to an 
ideal device whose performance is 
limited only by fluctuations in the 
absorption of quanta, and to this 
Barlow (1958a) added the assumption 
of a dark current. Thus if the mean 
number of quanta, m, effectively 
absorbed from a flash of constant 
intensity, J, is given by m = gl, 
where g is a constant, J, is the dark 
light, and detection is limited only 
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by the variance of the number of 
quanta absorbed, then the incremental 
threshold, AJ, will be 2810 + J.). 
Therefore the model based on these 
two noise sources predicts a power 
function with the exponent p = .5. 

Agreement with the square root 
law has been found for stimuli of 
small area and short duration on back- 
grounds of low intensity for rod vision 
(Barlow, 1957a) and, under similar 
conditions, for cone vision (Barlow, 
1958a). But under other conditions, 
“the experimental points deviate from 
the appropriate theoretical curve and 
tend to obey the Weber law instead 
(Barlow, 1958b J.“ Since most stud- 
ies have found good approximations 
to the linear generalization of Weber's 
law for the greater part of the range 
of stimulation (Aguilar & Stiles, 
1954; Blackwell, 1946; Steinhardt, 
1936), we face the problem ‘can 
reasons be found for the departures 
from the theoretical predictions which 
occur when the test stimulus is of 
long duration or large area, and when 
the background intensity is high 
(Barlow, 1957a]?” 

An approximately linear Weber 
function is so ubiquitous in studies 
of sensory discrimination that any 
explanation for it for visual intensity 
should be capable of generalizing to at 
least some other modalities. Since, 
on the signal detection model, noise 
will limit threshold discrimination on 
all dimensions, and the two sources 
of noise considered above only partly 
explain the form of the Weber func- 
tion, it is prima facie plausible that 
the occurrence of linear Weber func- 
tions may be due to a feature of the 
third source of noise which is not 
specific to vision alone. This source 
is the “neural noise” which may arise 
from variation in the responsiveness 
of the afferent paths to the sensory 
inputs resulting from the absorption 
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of quanta from the stimulus (or from 
the dark light). Since little is known 
about this noise, assumptions must 
be made if predictions are to be de- 
rived from it. Two hypotheses will 
be presented and their consequences 
discussed; evidence for them will be 
considered later. They are: (a) The 
effective absorption of a quantum o 
light (or a noise event equivalent in 
its effects to the breakdown of a 
molecule of visual pigment) will pro- 
duce a sensory message of mean mag- 
nitude s and with variance gẹ. If 
m quanta are effectively absor 

from a light flash, and equivalent 
noise events occur, the central 2 


m+n 
of that flash will be given by E = x Si 


For flashes of constant nominal in- 
tensity I the ‘mean central effect 
will be 


E = sg(I + In) [2] : 


This hypothesis is discussed and 
modified below. (b) One way in 
which the afferent visual system may 
be conceived is as a set of parallel 
afferent paths, each of which may 
serve to convey the sensory message 
resulting from the absorption of à 
single quantum of light. The total 
central effect resulting from a given 
stimulus administration is then the 
sum of the messages carried by all the 
paths activated by that stimulus. 
The responsiveness of each path 
varies from trial to trial, so that the 
central effect produced by the absorp- 
tion of a quantum by a given receptor 
on different trials will vary; this varia- 
tion in responsiveness is the source d 
a. Furthermore, the different paths 
do not vary independently in their 
responsiveness, but in a related way: 
so that on one trial the responsiveness 
of all paths may be greater than usualy 
on another trial less. These assump 
tions about the afferent systems 4 
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quoted to illustrate a possible physio- 
logical basis, not necessarily the only 
one, for the second hypothesis: that 
the product-moment correlations rij 
between s; and s;, where these are 
any two of the sensory messages 
produced by the absorption of quanta 
from a given light flash (or by equiva- 
lent noise events), will be positive ; 
we shall suppose them to approximate 
to unity. Since s and g(I + J,) are 
independent, and the latter is de- 
scribed by the Poisson distribution, 
it will then follow from Equation 2 
that: 


6g (III.): 
+og(I+1.) [3] 


and consequently: 


T2 0E 
Sg 


[4] 


where v = /s is the “coefficient of 
Variation” of the sensory message. 

Since z, g, v, and J, are constants, 
it can be seen that if o < 1, which is 
plausible, then for g(J + I) small 
the function will approximate 


AI = gà (I + I,)3 [5] 


and as g(I + I,) becomes large there 
will be a transition to a function 
approximating 


AI = (I + In) [6] 


Thus the assumptions made lead im- 
mediately to the transition from a 
Square root law at low intensities to 
the linear Weber function at high 
intensities which required explana- 
tion. We may note that the exponent 
p in Equation 1 can now be inter- 
Preted as a measure of the extent to 
Which positive correlation between 
the sensory messages on each trial 
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determines the variance of E, and k 
can be defined in terms of z, g, and v. 

If this model correctly represents 
the phenomena which underlie the 
Weber functions, it will appear that 
Nature has been unkind to the experi- 
mental psychologist in burdening him 
with a relation dependent on four 
parameters. However this number 
does not present an insuperable bar 
to devising experimental tests of 
features of the model, as can be seen 
if the parameters are considered 
individually. 

1. In Barlow (1957a) estimated 
J. by plotting log AJ against log J 
and taking the intensity of J at which 
the curve fitted to these values inter- 
sected the ordinate corresponding to 
the absolute threshold as the dark 
light; Gregory (1956) estimated 7. 
by extrapolating the linear portion of 
the curve relating AJ to I to an inter- 
cept with the abscissa. The estimates 
obtained probably depended on rod 
mechanisms in the former case and 
cones in the latter; but if a Weber 
function were determined with pre- 
cautions taken to make it depend on 
the rods alone, then independent esti- 
mates of J, could be obtained from 
the portion of the curve described by 
Equation 5 and that described by 
Equation 6, and we could examine 
whether they are the same, as, on the 
present assumptions, they should be. 
Another procedure for estimating J. 
from absolute threshold judgments, 
which, however, is not very exact, is 
given by Barlow (1956). 

2. z. This parameter can be elimi- 
nated from Equation 4, or it can be 
estimated in two independent ways. 
(a) If instead of the incremental 
threshold we measure the differential 
threshold by the method of con- 
stant stimuli, requiring the responses 
“Greater” or “Less,” the resulting 
psychometric function will have its 
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50% point at J’, the standard stimu- 
lus (ignoring any constant error). 
If we take the standard deviation of 
the psychometric function, op, as AJ, 
this will correspond to ag, not to zog, 
so that 2 will disappear from Equa- 
tions 4, 5, and 6. Thus this psycho- 
physical procedure would give a 
Weber function dependent on only 
three parameters. (ò) Crozier's law 
(Crozier, 1936; Crozier & Holway, 
1937; Holway, 1937; Holway & 
Crozier, 1937; Upton & Crozier, 1936) 
can be written AJ/osr = C, where C 
is constant and gar is the standard 
deviation of AI (i.e., o when the 
method of constant stimuli is used). 
This relation could be written 
ad" 1% “ — I") = C (see Fig- 
ure 1), and this corresponds to 
E — E')/(E’” —— E”) — zog/og =$; 
When Æ is linearly related to J (for 
the ranges used in determining thresh- 
olds) C = s exactly, and even if the 
relation is nonlinear it appears that, 
in some cases, C may still not differ 
much from 2 (Treisman, 1964c). 
Thus, if the incremental threshold is 
measured, C may be calculated and 
taken as an estimate of s. (c) If 
the false positive rate, P(Y|I'), is 
recorded for the incremental thresh- 
old, s can be determined by reference 
to a table of the normal curve. 

It should be possible to estimate z 
by both procedures, and compare the 
two results obtained (though the 
large number of observations required 
to determine a small false positive 
rate accurately is a problem in prac- 
tice), or to use these estimates when 
attempting to fit Equations 5 or 6 
to experimental data. 

3. v. If zis known v could be ob- 
tained from the slope of the linear 
portion of the Weber function. Since 
v = o,/s, any factor which would tend 
to increase the variability of the 
neural response would be expected to 
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increase the slope of the linear porti 
of the Weber function. Age mig 
be thought to be such a factor, 
prediction made and tested by Can 
and Gregory (1957), who found that 
the slope, but not the intercept wit 
the abscissa, of the linear portion ¢ 
the curve increased in older subjects 
they interpreted this as indicating a 
increase in neural noise but not 
retinal noise (J.). i 

4. g. This constant relates 
number of quanta effectively al 
sorbed to the intensity of the stimulus. 
It will depend on the area and di 
tion of the stimulus, and on f, the 


by the visual pigment. The value of 
f can be estimated by consideration of 
the physical and chemical properties 
of the eye and retina; Rushton (1956) 
estimates that 10% of light entering 
the eye (A = 505 my, 20° parafove: 
is absorbed by rhodopsin. Thus es 
mates of g, derived in this way, 24 
In could be substituted in Equation 
to predict the exact form of the low 
portion of the Weber function; or 
value of g could be derived from tl 
threshold data, applying Equation 
and compared with that given 
physical considerations. 

A comparison of Equations 5 and © 
will suggest another prediction: sin 
f, and therefore g, is a function 
wave length, and g is present in the 
former equation but not the latt 
wave length should affect the Web 
function for low but not for 
stimulus intensities. This expe 
tion is supported by observation: 
high intensities the Weber fra 
AT/I of the cone system as a v 
becomes constant (Weber's 
and has about the same value for 
of all spectral compositions LPirem 
1962].”’ AS, 

Since g depends on the area a 
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duration of the stimulus, a further 
test of the model would be given by 
seeing whether predictions that can 
be derived from Equation 4 for varia- 
tion in the area and duration of the 
stimulus agree with experimental find- 
ings on spatial and temporal summa- 
tion. This question will be examined 
more closely in a later section, but 
first further consideration will be 
given to the two assumptions made in 
deriving the model. 


Visual Pathways 


Relatively little is known of the 
detailed functioning of the afferent 
pathways; nevertheless, if the two 
assumptions on which Equation 4 is 
based are to be accepted they must 
be compatible with what is known of 
Sensory physiology. They are, of 
course, simplifications which need not 
hold exactly at the physiological 
level. Linearity need not be perfect 
nor correlation complete for Equation 
4 to fit the data to a good approxima- 
tion. In this section some of the 
possible ways in which neural mecha- 
nisms underlying vision might func- 
tion will be discussed, mainly in order 
to demonstrate that the hypotheses 
given above could describe the be- 
havior of physiological bases which 
are not completely implausible, and 
Without wishing to imply that other 
Physiological assumptions could not 
be made which would be equally or 
more acceptable. 

We start by supposing that the 
effective absorption of a quantum of 
light causes a train of impulses to 

transmitted from the receptors, 
through bipolar cells, to the ganglion 
cells, where the different trains set 
up by the absorption of quanta from 
a given flash sum to give E, the linear 
relation given in Equation 2 applying 
to this transmission. The neural 
noise arises from variation in trans- 
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mission at the retinal synapses, ie., 
variation in the number of impulses 
produced in a bipolar cell when a 
quantum of light is absorbed by its 
receptor, or in the number of post- 
synaptic impulses produced in a 
ganglion cell by the arrival of a 
constant presynaptic train. This 
variation in transmission is the result 
of the presynaptic train being added 
to a pre-existing level of facilitation 
at the synapse, this level varying 
from trial to trial, and the sum of the 
two effects determining the postsynap- 
tic response. The facilitation might 
be due to a bombardment of all the 
synapses, produced by a single central 
source whose output varies, and 
mediated by centrifugal fibers (Granit, 
1955). When the output of this 
source is high transmission will be 
good at all the synapses, when it is 
low each quantum absorbed will make 
a smaller contribution to the central 
effect than usual, so that the central 
effects of the different quanta ab- 
sorbed from any one flash will not be 
independent but will show some de- 
gree of positive correlation. Provided 
that the greater proportion of the 
neural noise is assumed to arise at 
this linear stage in the retina it would 
not be necessary to require the gang- 
lion cell response also to be a linear 
function of J. If, for example, the 
correlated noise occurs before a loga- 
rithmic transformation, application 
of the Neyman-Pearson criterion to 
the latter would have the same conse- 
quences as if it were applied directly 
at the linear stage, and Equations 5 
and 6 would follow as before. 

Some evidence on the transmission 
of the afferent message from the re- 
ceptor to the ganglion cell is provided 
by neurophysiological observations 
which show that Riccò's law holds 
for single receptive fields of the frog 
and cat, when the effect of light is 
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measured by its ability to evoke a 
threshold or constant suprathreshold 
discharge from the ganglion cell 
(Barlow, 1953; Barlow, FitzHugh, & 
Kuffler, 1957; Hartline, 1940). Within 
limits, O = AJ is constant as A, the 
area of the stimulus, varies, which is 
taken to indicate that the effects on 
the receptors sum at the ganglion 
cell. If the mean quantity of light, 
Q, applied to a receptive field is con- 
stant, and on any occasion it is dis- 
tributed over r receptors, where r 
may vary from trial to trial, then 
each receptor will receive a quantity 
of light Q/r, and will transmit a 
message to the ganglion cell which is 
a function of this quantity, f(Q/r). 
The total effect produced at the 
ganglion cell by a given presentation 
of the stimulus will then be rX (/r). 
This expression will only be constant 
as r varies if f represents multiplica- 
tion by a constant, therefore, since 
Riccò's law holds for the ganglion 
cell response at levels of illumination 
at which each receptor is likely to 
absorb more than one quantum, this 
indicates that the transmission from 
the receptors to the ganglion cell 
must be effectively linear. 

A difficulty which would arise if 
this model were accepted, with the 
discharge from the ganglion cell taken 
to be logarithmically related to stimu- 
lus intensity (or to have some other 
nonlinear monotonic relation to it), 
is that since log Ys ¥ Ð logs, the 
model would then be adequate only 
for stimuli whose area falls within one 
receptive field, as stimuli larger than 
this would require the logarithmic 
outputs from two or more fields to 
be summed centrally; consequently, 
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it would become difficult to explain 
the similar laws for spatial and tem- 
poral summation (see the next sec- 
tion) when large areas are used. (The 
same difficulty arises if an attempt is 
made to attribute Weber's law to the 
addition of constant variance after a 
logarithmic transformation.) But it 
is not plausible, on the other hand, to 
suppose that Equation 2 applies 
exactly to the ganglion cell discharge 
over the whole range of light intensi- 
ties. Fortunately, it is not necessary 
to make so strong an assumption. 
Since in determining visual thresholds 
the eye is usually adapted to a fixed 
intensity of J, to which near-threshold 
increments, or decrements, are added, 
we need only postulate approximate 
linearity for near-threshold ranges in 
the adapted condition. Thus the 
effect of adaptation to a constant 
stimulus, I, might be to eliminate the 
response to a constant proportion of 
the mean number of quanta absorbe 

from it, hg (U + I.), h<1 being 
constant, so that only quanta ab- 
sorbed in excess of this effective zero 
point on any trial would produce à 
neural message, with the variance 0 
g(I + In), and near-threshold incre- 
ments or decrements in J, producing 
approximately linear changes in this 
message. On this assumption Equa- 
tion 5 would be unchanged, and 
Equation 6 would become © 

= 21 — h) (I + In), which is again 
a linear function. Or we might sup- 
pose that when the eye is adapted to I 
a neural message, given by 5 
log hg(I + In), is produced, to which 
any excess of stimulation adds a 
linear increment, On this assumption 
Equation 4 would become 


AI == Vg E A Ly) + Ped = I F log hg T I 


me 
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and this again would give Equation 5 
for small intensities of g(J + J,); as 
intensity increases g + J,)* would 
grow more rapidly than g(J + J,) or 
log g (I + I.), with the result that 
at high stimulus intensities the term 
giving a linear function would again 
be the most important in the equation, 

On these assumptions the variance 
due to neural noise could arise at the 
retina or at higher synaptic junctions, 
being caused by a single source of 
facilitation which might be identified 
with ‘reticular arousal,” or in some 
other way. 

The assumption that small changes 
in intensity about a level to which the 
eye is adapted will cause approxi- 
mately linear changes in the ganglion 
cell discharge receives some support 
from observations made by FitzHugh 
(1957). He examined the effect of 
brief flashes of light of near-threshold 
intensity on the discharge from the 
cat's ganglion cell, recording the rela- 
tion between the number of impulses 
produced in a critical period following 
a flash and the intensity of the latter; 
he fitted power functions to the data 
for individual cells and obtained mean 
exponents of .6-1.2, There is also 
some psychophysical evidence tending 
to support him (Treisman, 1964c). 

There is little evidence to indicate 
whether or not sensory noise is corre- 
lated. However, an ingenious experi- 
ment by Diercks and Jeffress (1962) 
is of interest. When a tonal signal 
is presented in white noise the thresh- 
old for the tone is lower the greater 
the disparity between the interaural 
correlations for the tone and for the 
white noise (Hirsh, 1948; Jeffress, 
Blodgett, Sandel, & Wood, 1956). 
Thus if the tone and noise are both 
in phase at the two ears, the threshold 
will fall if the tone at the input to one 
ear is removed, and will fall further 
if it is returned to that ear but 180° 
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out of phase with the tone on the 
other ear. Diercks and Jeffress meas- 
ured the absolute threshold for a 
250 cps tone given alone, and found 
that the threshold was higher when 
the signals to the two ears were in 
phase than when they were 180° out 
of phase, which they interpret as 
indicating that the signal is added to 
internally generated noise with some 
degree of positive interaural cor- 
relation. 

A square root law was first sug- 
gested by Cattell (1893), who argued, 


The algebraic sum of a number of variable 
errors tends to increase as the square root 
of the number. In measuring the base line 
of a survey the variable error of observation 
increases as the square root of the length of 
the line. It seems to me the same relation 
might be expected to hold in a general way 
when the length of a line is estimated by the 
eye. . . . I should substitute for Weber's law 
the following: The error of observation tends to 
increase as the square root of the magnitude. 


Fullerton first suggested that correla- 
tion between “fractional or elemental” 
errors of observation might contribute 
to the appearance of Weber's law 
(Cattell, 1893); later Woodworth 
(1914) argued that “The reaction of 
the organism to a stimulus is always 
a complex activity, composed of 
many elementary activities,” and that 
these might be correlated. However, 
perhaps in part because of a fallacious 
rebuttal by Guilford (1932), this 
suggestion dropped out of sight. 


Spatial and Temporal Summation in 
Vision 

The energy in a light flash can be 
varied by altering its intensity, area 
or duration. Any account of the 
effect of change in Z on AI should, 
therefore, be capable of being ex- 
tended to cover spatial and temporal 
effects in similar detail. 

The main features of spatial and 
temporal summation are similar. For 
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Fic. 2. Data obtained by Barlow (1958b). 
tensity ] is plotted against log [duration] for a small [.011 degree?] and a large 
(27.6 degree*] area of stimulus at five different background intensities. The 
straight lines have slope —1 and are continued up to. I second. Intensities are 
given in quanta [507 mp]/second degrees, durations in seconds.) 


sufficiently small areas the relation 
of the absolute threshold, Jo, to the 
area of the stimulus, A, is given by 
IpA! =e (Riccd’s law). There is 
then a transition range of area for 
which Piper’s law, To! = c, is some- 
times claimed to hold, and for large 
areas the threshold is independent of 
area, i. e., Io“ = , (c, c, and el 
being constants). These three rela- 
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(Log [increment threshold in- 


partial summation, or the absence OP 
summation, the degree of summat 
being indicated by the exponent of A 
this exponent can be represented a 
the slope of the curve relating log 
to log A. Similarly, when the d 

tion, t, of the stimulus is varied, co 
plete temporal summation is obtai 
for short durations, i.e., Bloch’s 1 


Noise AND Wenger's Law 


(the Bunsen-Roscoe law), Jot! = &, 
holds. Partial temporal summation 
is obtained in a transition range, 
and for long durations Jo = k” (k 
and k” being constants). Change in 
area or duration has similar effects 
on the difference threshold (Barlow, 
1958b; Brindley, 1960; Le Grand, 
1957). 

Evidence supporting Piper's law 
or the corresponding square root law 
for time has been obtained by a 
number of authors (Baumgardt, 1959; 
Bouman & van der Velden, 1947; van 
der Velden, 1944; Willmer, 1950). 
However, deviations from these laws 
are frequent (Brindley, 1960; Le 
Grand, 1957) ; they are well illustrated 
by Barlow's (1958b) data (see Figure 
2). He measured absolute and incre- 
mental thresholds varying both ¢ and 
I, the background intensity, for differ- 
ent values of A, or varying A and I 
for different values of f. His results 
show that, for temporal summation, 
increase in either the background in- 
tensity or the area of the stimulus 
will tend to reduce the duration at 
which complete summation ends and 
the transition range begins, and the 
duration at which partial summation 
ends, the transition range becoming 
shorter, and the mean exponent of £ 
in this range decreasing. The effects 
of increase in f or in I on spatial 
summation were very similar. The 
form of the transition ranges appears 
to accord with Graham, Brown, and 
Mote’s (1939) finding that the ex- 
ponent is not constant in this range 
but decreases “as one progresses from 
the smallest area to the largest area.” 
Thus it appears that increase in J, A, 
or £ all tend to reduce the exponent of 
A or 1. This gradual change in 
exponent is quite compatible with the 
approximate adherence to a square 
Toot law over part of the transition 
Tange sometimes reported. 
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Brindley (1960) argues, 


If the relevant noise is assumed to be propor- 
tional to the area and duration of the stimulus, 
as is reasonable for large and long stimuli, the 
threshold should be inversely proportional to 
the square roots of area and duration (Piper's 
law and its temporal counterpart). . . . Since 
[these laws] are not exactly true, the hy- 
pothesis of optimal discrimination of signal 
from noise cannot be exactly correct. 


The form of the signal detection 
model he thus questions is that based 
only on background and physical 
noise. This model also cannot ex- 
plain why, for still larger areas or 
longer durations, the threshold be- 
comes independent of area or dura- 
tion. This makes it of interest to 
see whether the model developed 
here, which takes account of neural 
noise as well as the other sources, 
can account for summation more 
adequately, and thus avoid Brindley's 
stricture. 

If, on any trial, an increment AJ, 
is added to a background stimulation 
of constant intensity J’, whether or 
not it is seen will depend on whether 
the value of E produced by (I + Al.) 
exceeds cg = E + seg. The values 
of E and cg depend on the area and 
duration of the background stimula- 
tion which the decision device sam- 
ples. It is often assumed that there 
is an area a and a duration 7 below 
which complete summation occurs; in 
terms of the present model these would 
be the minimum area and duration 
sampled in reaching a threshold deci- 
sion. If the area of the increment is 
A” and its duration t”, it is simplest 
to assume that the area and duration 
sampled on each trial will be A’ = a 
or A”, whichever is the larger, and 
t = 7 or t”, whichever is the longer. 
This implies that g must be separately 
defined for J“ and AI,; g relates the 
number of quanta which are effec- 
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tively absorbed and contribute to the 
decision process to the intensity of the 
stimulus, and it will depend on the 
area and duration contributing quanta 
and on f, the fraction of the quanta 
incident at the cornea which is effec- 
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tively absorbed by the visual pigment 
(Pirenne, 1956; Rushton, 1956). For 
I’ (in quanta/second degree’ entering 
the eye), g will be given by fA’’, and 
for AJ, by fA"t”. We can now re- 
write Equation 4 as 


2 


x 44727 
AI jae VA (I! + 


For J’ and t” constant, and A” <a 
variable, the expression under the 
square root sign can be written as a 
constant, c, giving 

zc 


jae [8] 


which is, of course, Riccò's law. For 
A” >a Equation 7 can be simpli- 
fied in the same way as Equation 4, 
giving: 


AI = 


200 j 
Al = Fame E + In) [9] 
for HA + In) small, and: 
svt! 
AI = 75 (I + In) [10] 
for fA't'(I' + In) large. Equation 9 


is Piper’s law, and Equation 10 makes 
the threshold independent of area. 
Thus as fA’t'(I’ + In) increases there 
will be a continuous transition be- 
tween these two laws. An exactly 
similar argument applies to temporal 
summation. 

Thus the present model accounts 
for the inadequacy of the square root 
laws as descriptions of partial spatial 
and temporal summation, the de- 
crease of the mean exponent in, and 
the extent of, the transition range 
as A”, t, or I’ increase and the 
decrease in the exponent as we 
go from smaller to larger areas or 
durations in the transition range. 
It explains why the duration at which 
the temporal transition range ends is 


I)? A + P) + In) [7] 


shorter when A” or J’ are large, and 
the area at which the spatial range 
ends is smaller for t” or J’ large, and 
it accounts for the independence of 
area or duration found when the 
latter are very large. 

A number of authors have found 
that increase in A” will reduce the 
amount of temporal summation, in- 
crease in : will reduce the amount of 
spatial summation, and an increase in 
I’ will reduce both spatial and tem- 
poral summation (see Barlow, 1958b). 
The results in Figure 2 accord with 
this; Barlow (1958b) notes that the 
square root laws are never simul- 
taneously true for all three variables— 
to obtain the square root law for 
intensity, for example, areas and 
times within their ranges of complete 
summation must be used. This 
agrees with the implications of Equa- 
tion 7, since the exponent obtained 
depends on the magnitude of the 
product A't'(I' + I.); this product 
is more likely to be small and so to 
give a small exponent if the variables 
held constant are small than if they 
are large. 

If a can vary from time to time, 
this would explain the occasional ex- 
ponent smaller than 1 but larger than 
.5 (Brindley, 1960). It would give a 
range of areas governed sometimes by 
Equation 8 and sometimes by Equa 
tion 9 and having, therefore, an 
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intermediate exponent. That a and 
7 are not invariable is suggested by 
the decrease in the area of total sum- 
mation as ?“ or J’ increase, or in the 
duration of total summation as A” or 
increase (Barlow, 1958b). 


The extent of spatial summation in the living 
human eye is in broad agreement with the 
size of the retinal area covered by the ramifi- 
cations of the large ganglion cells which 
presumably are responsible for this summa- 
tion [Pirenne, 1956], 


and Barlow, FitzHugh, and Kuffler 
(1957) have shown that lateral in- 
hibition is reduced in the cat's retina 
in dark adaptation. If the retina can 
be considered to consist of ‘sensitive 
units” (Brindley, 1960; van der 
Velden, 1944), i.e., receptive fields 
summating input over a period of 
time, it is possible that mechanisms 
might exist tending to prevent the 
output of a unit, far (I + In), ex- 
ceeding some upper limit, so that if 
A”, t”, or I’ increase there will be 
“feedback” tending to reduce a and 
. In the case of à this might operate 
through increase in lateral inhibition. 

The threshold intensity is inde- 
pendent of the area and duration of 
the stimulus when the latter are large, 
and this is sometimes taken to mean 
that there is an upper limit of area 
or time beyond which summation 
ceases; the threshold model based 
only on physical and background 
noise requires such an assumption to 
explain why the square root law does 
not apply to large areas and durations. 
Presumably the stimulus information 
beyond these limits is supposed not to 
contribute to threshold decisions; 
thus the smaller the limit the less 
efficient the eye. Barlow (1958b) 
believes his results indicate: 


that big changes in the amount of temporal 
and spatial summation occur when the back- 
ground intensity is changed, with the result 
that when thresholds are determined with a 


long duration large area test stimulus the 
experimental points deviate from the ap- 
propriate theoretical curve and tend to obey 
the Weber law instead. 


Here the term summation is used 
descriptively to refer to the occurrence 
of a fall in threshold intensity as area 
or duration increases. The present 
arguments show that we need not 
attribute variation in summation, in 
this sense, to an equivalent variation 
in the action of an underlying sum- 
mating mechanism with built-in upper 
limits. On the present model there is 
no upper limit to the area or duration 
of the stimulus effectively utilized, 
and variations in such limits are not 
needed to account for the different 
Weber functions obtained when condi- 
tions vary. Information is always 
drawn from the whole area and dura- 
tion of the stimulus, and threshold 
discrimination is limited only by the 
variability of the processes involved, 
as indicated by Equation 7. 


Other Dimensions 


In the study of sensory function 
approximations to Weber's law are 
ubiquitous, so that an explanation of 
the function will appear more plaus- 
ible if it is not peculiar to a particular 
modality but is fairly general in its 
application, The present assump- 
tions satisfy this requirement fairly 
well. As an example we can consider 
the visual discrimination of the length 
of lines. We suppose that the pre- 
sentation of the lines is sufficiently 
well controlled for variation in the 
visual angle subtended by a given 
line to be negligible. When a line of 
length L is presented, it excites visual 
eine detecting elements” whose num- 
ber is given by d = kL, where k is a 
constant. In the absence of L, spon- 
taneous activity which can be con- 
fused with the effect of the line may 
occur, exciting RL, line-detecting ele- 
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ments. An active element produces 
a sensory message s, with variance 
o#; the central effect produced by a 


presentation of L will therefore be 
k(L +L.) 


E= © sa If all the intercorrela- 
721 


tions, , between the sensory effects 
produced each time the line is pre- 
sented approximate to unity, then 
the variance of E will be eg 
= k'o (L + L,)?, and consequently 
the Weber function will be AZ 
= w(L + La), where v = o,/s, which 
is the linear generalization of Weber's 
law. 

The Weber function for temporal 
discrimination has been derived in a 
similar way (Treisman, 1963a). The 
linearity and correlation assumptions 
are so general that they could be 
applied to predicting the variance of 
nonsensory data. Crossman* has 
noted that the standard deviation of 
reaction times tends to be a constant 
proportion of the mean, between 10 
and 25%, a relation analogous to 
Weber's law. We might suppose that 
the length of a reaction time is mainly 
due to the occurrence of a greater or 
lesser number of elementary neural 
events of similar duration, such, 
perhaps as synaptic crossings, which 
intervene between the stimulus and 
response, so that the reaction time 
is linearly related to this number. 
The duration of the elementary events 
which sum to give the reaction time 
varies in highly correlated fashion 
from trial to trial: this will lead to 
“Crossman’s law” in the same way 
that Weber's law was obtained above. 
This explanation need not conflict 
with the hypothesis that reaction time 
is determined by the rate at which 
information can be processed (Hick, 
1952), or the time necessary to execute 
a sequential decision procedure (Stone, 
1960); we could suppose that these 
procedures are performed by means 
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of the “elementary neural events’ 
postulated above and determine the 
number necessary, and that variatio 
in the duration of these events is th 
major source of response variance. 
However, it is of interest that Stein? 
applies the negative binomial distri- 
bution to reaction times and finds 
that under certain conditions the 
standard deviation will approximate 
a constant proportion of the mean. 

The account of the Weber function 
that has been given is necessarily 
simplification, since it is unlikel 
that in actual cases the assumption 
made would be wholly met. It ha 
the advantage of being sufficiently 


tion for any sensory dimension, ra 
than describing a special case onl 
It is to be hoped that relating Weber 


the signal detection model of th 
threshold may serve as a basis fo 
clarifying its implications for the 
troubled field of sensory scaling (Treis 
man, 1961, 1962, 1963b, 1964b, 1964c) 


SUMMARY 


The signal detection model fo 
threshold discrimination is outline 
and applied to some familiar 
cedures for measuring thresholds. £ 
attempt is made to use this model to 
explain the form of the Weber func- 
tion. Three sources of noise limitin: 
discrimination are discussed: the i 
reducible physical variability of 
stimulus; the spontaneous bac! 
ground” noise to which the stimulus 
can be considered to be added; and 
neural noise arising from variatio 
in the response of the pathways trans 
mitting the sensory message centrall! 
Two assumptions are added to 
signal detection model: (a) that 


R. B. Stein, personal communica’ 
1963. 
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central effect of the stimulus serv- 
ing as decision axis approximates a 
linear function of stimulus intensity 
over the threshold range; (b) and 
that the neural noise arises not from 
independent, but from positively cor- 
related sources. With these additions 
the model explains both the approxi- 
mation to Weber's law which is ob- 
tained at high stimulus intensities, 
and the departures from the law 
which occur at low intensities, for 
visual intensity discrimination and 
other dimensions, as well as features 
of visual spatial and temporal 
summation. 
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THEORETICAL NOTE 


HOW SINCERE IS THE DOGMATIST? 


GORDON STANLEY axo JOHN MARTIN 
University of Western Australia 


Rokeach has proposed two hypotheses as alternatives to the acquics- 
phenomenon. 


cence explanation of the double-agreement 


The first of 


these (Bı) states that the respondent when replying to the reversed 
scale “sees a good reason why he should not tell the truth... . be 
(thus) tells the truth to the original statement and he deliberately lies 
to the reversed statement.” This is interpreted as implying that re- 
sponses to a reversed scale of dogmatism, authoritarianism, or anti- 
Semitism would correlate positively with scores on social desirability 
or lie scales. Empirical data are advanced to refute this hypothesis. 


Rokeach (1963) has recently proposed 
two new hypotheses to explain acquies- 
cence or the double-agreement phenome- 
non. Until now this phenomenon has 
been interpreted largely in terms of re- 
sponse style, but Rokeach’s hypotheses 
return to a consideration of content. The 
first of these, Hypothesis B,, states that, 


A person may agree with a statement and 
with its opposite because in both instances 
he reads them, comprehends them and re- 
sponds to their content. He tells the truth 
in one case because he sees no reason why 
he should not; he deliberately lies in the 
second case because he sees a good reason 
why he should not tell the truth [Rokeach, 
1963, pp. 304-305]. 


He considers that the subject agrees with 
the positively worded statement because 
its “antidemocratic tone” is not apparent. 
Agreement with such an item will indi- 
Cate authoritarianism, anti-Semitism, or 
dogmatism as the case may be. When 
Presented with the reversed item, the sub- 
Ject wishes to disagree with it but fears 
that so doing may produce a “bad” 
pression on the experimenter, conse- 
quently he lies in response to the re- 
versed statement and produces double 
agreement. 

Rokeach says that the converse of this 
hypothesis, viz. that the subject lies in 
response to the original items and tells 
the truth in response to the reversals is 
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psychologically meaningless and hence 
improbable. 


Hypothesis B, advances no explanation 
of the double-disagreement phenomenon 
(Peabody, 1961) but may be logically 
extended to say that the nonauthoritarian 
(pro-Semitic, nondogmatic) subject re- 
sponds truthfully to the original and un- 
truthfully to the reversal. If this exten- 
sion is entertained it is apparent that a 
psychological explanation for these sub- 
jects’ disagreement with the nonauthori- 
tarian (pro-Semitic, etc.) item could also 
depend on their unwillingness to make a 
bad impression. It is not psychologically 
meaningless to say that subjects who are 
nonauthoritarian (ete.) may feel that 
these attitudes are not socially acceptable 
and may seek to conceal them by giving 
insincere replies to the negative item. 

These hypotheses, together with other 
variations on the theme raised by Ro- 
keach's Hypothesis B, may be conven- 
iently tested by observing the correlation 
between responses to positive and re- 
versed scales (of the specified type) and 
suitable measures of insincerity, e.g., so- 
cial desirability and lie scales. 

The various hypotheses are: 


1. Insincerity appears only among the 
acquiescent group 
(a) in relation to the negative scale 
(Rokeach’s Hypothesis B,). 
(b) in relation to the positive scale 
(converse of above). 
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2. Insincerity appears only among the 
dissenting group 
(a) in relation to the negative scale. 
(b) in relation to the positive scale. 


3. Insincerity appears among all sub- 
jects 
(a) in relation to the negative scale. 
(b) in relation to the positive scale. 


The following data serve to test these 
hypotheses, 


Ax Empiricat TEST 


A reversed scale of dogmatism (D—) 
was constructed based on the same ra- 
tionale as Rokeach (1956) and prima 
facie measuring the same psychological 
variable. The wording of the original 
was departed from when it was consid- 
ered that this allowed the production of 
a more satisfactory item (Christie, Havel, 
& Seidenberg, 1958), otherwise reversal 
was obtained by simple grammatical 
changes. 

Research results (e.g., Bendig, 1962; 
Jackson & Messick, 1962) suggest that 
social desirability and lie scales are the 
best available measures of insincerity. 

In this study the Martin Social Desira- 
bility (SDB) Scale, construction of 
which is being reported elsewhere, was 
used as the indicator of social desira- 
bility. This scale employs 23 positive 
and 23 negative items balanced for acqui- 
escence. Martin has found reliabilities 
(Cronbach’s alpha) for this scale of .84, 
77, 75, and 70 with different groups. 
The lie (L) scale of the MPI (Gibson, 
1962) was used as an indicator of faking 
or lying. Gibson reported the split-half 
reliability of this scale to be .66 (Gutt- 
man's formula). Reliabilities found for 
the present sample are given in Table 1. 

These three measures, together with 
Rokeach’s Dogmatism scale (D+) were 
administered to Psychology I students at 
the University of Western Australia (N 
= 127). Tests were administered in a 
special test program, the order of admin- 
istration of D+ and D— being counter- 
balanced and separated in time by ap- 


1 Copies of the scale are available from 
the authors on request. 
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TABLE 1 


SHOWING CORRELATIONS BETWEEN POSITIVE 
AND REVERSED SCALES OF DOGMATISM, 
BALANCED (Ave AND - ve) DOGMA- 
TisM, SOCIAL DESIRABILITY, AND 


Lie SCALE 
(N = 127) 
SDB D+ D- DB L 
SDB | (.69) | —.234** | —.035 —.178* | +.397%% 
D+ (74) 1747 801 148 a 
D- (.60) 674** | —.136 
DB» 005 
L (.63) 
Note.—Figures in brackets are reliabilities (Cron- 
bach's alpha). 
a DB = balanced dogmatism scale. 
*p< 05. 
* S 01. 


proximately 1 hour. D— was scored in 
reverse so that scores represent dogma- 
tism. 


Ress 

When the data are analyzed for the 
group as a Whole (Table 1) it is found 
that: (a) the expected inconsistency of 
response (whether due to acquiescence or 
dissent) appears when D+ and D— are 
compared. This is attested by the low 
positive correlation between these scales. 
It accords with the results found by Pea- 
body (1961) using an independently con- 
structed reversed scale of dogmatism. (b) 
D— has no relationship with SDB, and its 
relationship with the L scale, although in 
the direction predicted by Rokeach, is not 
significant. (c) the relationship of D 
with SDB is negative and highly signifi- 
cant, so that the subject answering in © e 
socially desirable direction tends to, dis- 
agree with the positive dogmatism item. 
The relationship of D+ with the L scale 
is positive but not significant. 

The subjects were then divided at the 
means of D+ and D— to form four 
groups, two being characterized by con 
sistent-to-content responses (but of high 
and low value) and two by predominantly 
stylistic responses (acquiescers and dis- 
senters). Correlations between the rele- 
vant measures are shown separately for 
all four groups (Table 2). 
these coefficients is significant; that be 
tween D+ and SDB in the low-dogmati® 


Only one o 


— 


Tuerorericat Nore 333 


TABLE 2 


SnowtnG CorreLaTions within HicH- 
Acoutescent, LOW-ACQUIESCENT, 
HiGu-DoeGMatic, AND LOW- 
Doematic Grours 


High-acquiescent group (N = 34) 


D(+) D(-) 
SDB — 178 —082 
Lie scale 152 —0l4 


Low-acquiescent (dissenters) group (N = 27) 


D(+) D(-) 

SDB —291 —052 

Lie scale —359 —239 
Hich- dogmatic group (N = 32) 

D(+) D(-) 

SDB — 308 120 

Lie scale 153 —130 
Low-dogmatic group (N = 34) 

y D(+) D(-) 

SDB —339* —091 

Lie scale 014 189 


*p< 05. 


group, and this is in the negative direc- 
tion. Among the dissenting group the 
correlation between D+ and Lie scale 
approaches significance. This is also in 
the negative direction. Among the acqui- 
escent group, which is the specified group 
for Rokeach’s hypothesis, the correlations 
between D— and the measures of insin- 
cerity are negligibly different from zero. 


CoNcLUSIONS 


Clearly if any hypothesis is supported 
by these results it is Hypothesis 3(b). 
Insincerity is related to low scores on the 
Positive Dogmatism scale, the relationship 
being most pronounced among the low- 
dogmatic group and least among the 
acquiescent group. In view of the possi- 

ility that social desirability scales meas- 
ure only the most popular form of faking 
(Martin & Stanley, 1963), this conclusion 
May require qualification. 

There is no evidence to support Ro- 
keach’s Hypothesis B,. 


Revexseo Scare oy Docmatism 


J. The United States and Russia have 
many things in common. 

2. You cannot have a democracy in which 
one section of the community has most 


of the power. 

3. It is never necessary to restrict what 
people may say in public discussion 
of politics. 

4. When a man believes something he 
must make himself familiar with all 
the arguments against it. 

5. Man has within himself the power to 
control his destiny. 

6. The world is full of warm human 
relationships, 

7. In general most people show consid- 
eration for others. 

8 I would resent assistance with my 


personal p: 

9. Optimism about the future is man's 
natural state. 

10. What isn’t done today can equally well 
be done tomorrow. 

11. I never find it necessary to withdraw 
from a discussion no matter how 
heated. 

12. In discussions people generally don’t 
have much difficulty in understanding 
what I say. 

13. Even in a heated discussion I always 
pay careful attention to the arguments 
of others. 

14. It is not worth sacrificing your life 
to become a hero. 

15. I have never wished to be famous. 

16. Doing important work is not the main 
thing in life. 

17. Even if I were given all the breaks, I 
would not be able to do anything of 
great benefit to the world. 

18. A great proportion of the best thinkers 
have not had their achievements re- 
corded in history. 

19. I never take a dislike to people because 
of what they stand for. 

20. It’s possible to really live without be- 
lieving in any great cause. 

21. Life can be meaningful without devo- 
tion to ideals or causes. 

22. All of the philosophies which exist in 
this world have some truth in them 
and probably not one is totally correct. 

23. A person who is enthusiastic about 
many things is likely to develop 
greater character. 

24. It is always best to meet our political 
opponents half-way. 
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25. Differences of opinion in religion are 
more apparent than real. 

26. In these present days everyone should 
look to their own happiness. 

27. A person who attacks publicly the 
people who believe in the same things 
he does shows personal integrity. 

28. It is never necessary to be on guard 
against ideas no matter where they 
may originate. 

29. Many differences of opinion can sus- 


group. 
30. Truth is so elusive that no one can 


31. I like people to stick to their guns, 


s present that counts—the fu- 
ture will look after itself. 
38. If a man is to accomplish his mission 
in life he Fae! never take chances. 


THEORETICAL NOTE 


40. Most people show a lot of fores 
when it comes to something `w 
affects them personally. 
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PSYCHOLOGICAL REVIEW 


CRITICAL PERIODS, STIMULUS INPUT, AND 


EMOTIONAL REACTIVITY: 
A THEORY OF INFANTILE STIMULATION * 


VICTOR H. DENENBERG? 
Purdue University 


Experiments, using rats and mice, do not support the critical period 
hypothesis that there are certain limited time periods in infancy during 
which a particular class of stimuli will have profound effects upon 
subsequent behavior. Where findings are consistent with the hypothe- 
sis, further research has shown that the “critical period” is a complex 
function of amount of infantile stimulation. The central hypothesis of 
this paper is that amount of stimulus input in infancy acts to reduce 
emotional reactivity in a monotonic fashion. From this it follows that 
an inverted U function should be obtained between amount of infantile 
stimulation and adult performance for tasks involving some form of 
noxious element and which are of “moderate” difficulty. For tasks 
which are “easy” or “difficult,” the relationships between performance 
in adulthood and infantile stimulation should be monotonic, though 
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opposite in slope. Data supporting this theory are discussed. 


This paper has two purposes. The 
first is to question certain aspects of 
the critical period hypothesis (Scott, 
1958, 1962). The second, and more 
important, purpose is to develop a 
somewhat different hypothesis: that 
stimulation in infancy reduces the 
organism’s emotional reactivity; this 
reduction is a monotonic function of 
amount of stimulus input. In turn, 
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this hypothesized change in emotional 
reactivity offers a useful mechanism 
by which one can explain a number 
of disparate and seemingly unrelated 
changes in adult performance. 

The data to be considered, and the 
generalizations therefrom, come spe- 
cifically from research with rats and 
mice which have received various forms 
of stimulation between birth and wean- 
ing. For more general reviews of early 
experience research and the critical 
period hypothesis see Denenberg 
(1962b) and Scott (1962). 


CRITICAL PERIODS 


Recently Scott (1962) has suggested 
that there are three major kinds of 
critical period phenomena: one for the 
formation of basic social relationships, 
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a second concerned with optimal 
periods for learning, and a third involv- 
ing infantile stimulation. This discus- 
sion is concerned only with the last of 
these. 

There are at least two different ways 
in which the critical period hypothesis 
has been interpreted. One is that the 
same physical stimulation at different 
ages has different effects upon S; this 
says, simply, that S’s age is an im- 
portant parameter and can scarcely be 
questioned. The second interpretation 
is that there are certain limited time 
periods in development during which 
a particular class of stimuli will have 
particularly profound effects and that 
the same stimulation before or after 
this interval will have little, if any, 
effect upon the developing organism. 
This approach stems directly from 
the embryological meaning of critical 
periods, and it is in this context that 
Scott (1958, 1962; Scott & Marston, 
1950; Williams & Scott, 1953) has de- 
veloped and described the critical 
period hypothesis. The term, critical 
periods, when used in this paper, will 
refer to this second interpretation. 

Operationally, a test of the critical 
period hypothesis requires several ex- 
perimental groups which receive the 
same stimulation at different ages and a 
control group which does not receive 
the stimulation at all. If some of the 
experimental groups do not differ from 
the control while others do, this may 
be construed as evidence supporting 
the critical period hypothesis. (On the 
other hand, any significant differences 
among the experimental groups are 
support for the first meaning of the 
hypothesis. ) 

Although it is obvious that the age 
at which S is stimulated is an impor- 
tant parameter, when one carefully 

examines experiments investigating 
critical periods in infancy in rats and 
mice, there is little evidence support- 


ing the hypothesis that stimulation 
must occur only during certain de- 
limited time intervals to affect the 
organism’s subsequent behavior. In- 
terestingly, the only unequivocal dem- 
onstrations of critical periods are to be 
found in papers which used physio- 
logical, rather than behavioral, end- 
points (Bell, Reisner, & Linn, 1961; 
Levine & Lewis, 1959a). 

The general conclusion noted above 
might not be apparent from a casual 
inspection of the literature. There are 
several studies which, if taken in iso- 
lation, appear to support the hypothe- 
sis. For example, two early experi- 
ments on critical periods (Denenberg, 
1958, 1960) seemed to support the 
concept. In the first study mice ini- 
tially conditioned between 20 and 40 
days of age had essentially the same 
reconditioning scores at 50 days, and 
all these scores were significantly 
higher than either a 50-day control 
group or a group which was initially 
conditioned at 16 days. These data 
suggested a critical period around 20 
days of age such that conditioning ex- 
perience prior to that date did not 
affect subsequent reconditioning per- 
formance; experience after that date 
appeared to have the same effect upon 
reconditioning, regardless of the origi- 
nal age of conditioning or the actual 
amount of conditioning. A subsequent 
study with older Ss (Denenberg, 1960) 
established that it was indeed the age 
of the animals rather than the time 
interval between conditioning and re- 
conditioning which mediated the ef- 
fects.® 7 


A number of investigators have also 
shown, with the rat, that stimulation in 
infancy has different consequences as com- 
pared to equivalent stimulation in adulthood. 
(These include handling in infancy and early 
adulthood, Levine, 1956; raising rats in 
groups of 6 or 12 during infancy or adult- 
hood, Seitz, 1954; subjecting rats to handling 
and auditory stimulation in infancy and 
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However, another study with the 
mouse, though not directly comparable 
to the two above, disclosed some com- 

plexities concerning critical period 
phenomena which cast doubt on the 
degree of generality of the hypothesis. 
Denenberg and Bell (1960) studied 
avoidance learning in adulthood as a 
joint function of age of infantile stimu- 
lation (i. e., critical periods), intensity 
of stimulation in infancy, and intensity 
ol electric shock during adult perform- 
_ ance. These latter two parameters had 
been held constant in prior research. 
Analyses of the learning data revealed 
that critical periods were not significant 
as a main effect; but the Periods X 
Adult Shock and the Periods x Adult 
Shock x Infantile Stimulation interac- 
tions were significant. The disturbing 
thing was that the age variable (i.e., 
“critical periods) was dependent upon 
_ the particular level of stimulation used 
in the adult learning task. By the 
Selection of certain combinations of in- 
fantile stimulation levels and adult 
shock levels one could “prove” that 
stimulation during certain “critical 
periods“ could facilitate, have no effect, 
or interfere with avoidance learning in 
adulthood. The phenomena, then, ap- 
Pear to be far more complex than pre- 
Wiously recognized in the generaliza- 
tions afforded by the critical period 
hypothesis. 
„ These data suggest that any single 
© Critical period” is, at least in part, 
dependent upon the parameter of stim- 
ulus intensity. Thus, Denenberg’s 
prior studies (1958, 1960) should be 
looked upon with caution since differ- 
ent shock levels may have yielded very 
‘uferent findings. 
Critical period research with the rat 
generally used the procedure called 


“> 

Adulthood, Spence & Maher, 1962a; and 
temporarily blinding or deafening rats in in- 
cy or later life, Wolf, 1943.) 
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handling rather than shock. This 
consists of removing the pups from 
the home cage, and placing the young 
into containers (e.g, cans containing 
wood shavings, small wooden compart- 
ments, grid-floored boxes). The pups 
are left there for a short duration 
(generally 2 to 3 minutes though some 
Es have used intervals of 8 minutes or 
longer) and are then returned to the 
home cage. This is generally done 
once a day for different prescribed 
periods. 

In a series of experiments Denen- 
berg and Karas (1960, 1961) found 
that rats handled for the first 10 days 
of life were superior with respect to 
body weight, avoidance learning, and 
survival capability to ones handled 
during the second 10 days. In general, 
both 10-day groups were superior to 
Ss which had been handled for 20 days 
in infancy. 

These data suggested that it might 
be possible to handle Ss for fewer than 
10 days and possibly isolate specific 
critical periods related to different de- 
pendent variables. Therefore Denen- 
berg (1962a) handled different groups 
of rats for 3- or 5-day intervals 
at different ages during the first 10 
days of life. Measures of body weight, 
avoidance learning behavior, and sur- 
vival time, with one exception, did not 
support the critical period hypothesis. 
The exception was that Ss handled for 
the first 5 days of life were not sig- 
nificantly better than controls in avoid- 
ance learning while Ss handled on Days 
6-10 were superior to controls and 
equal to Ss handled on Days 1-10. 

Though the avoidance learning find- 
ings are compatible with the critical 
period hypothesis, another possibility, 
based upon the Denenberg and Bell 
mouse study, is that the failure. to 
modify avoidance learning by handling 
during the first 5 days was because the 
stimulation induced by handling was 
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not sufficiently intense to affect S. 
Denenberg and Kline (1964) tested 
this hypothesis by giving different 
groups of rats electric shock (0.2 milli- 
ampere) on Days 1-5, 1-3, 3-5, 2, or 
4. The shocked groups (with the ex- 
ception of the group shocked only on 
Day 2) were significantly better in 
avoidance learning than a nondisturbed 
control group. 

In conclusion, experimental research 
with the mouse and rat investigating 
has established that the critical period 
hypothesis is not sufficient to account 
for the findings. In instances where 
the data are consistent with the h; 
esis further study has found that the 
“critical period” is a complex function 
of the parameter of stimulus intensity. 
Research to date indicates that, for the 
rat and mouse at least, there may be as 
many “critical periods” as there are 
combinations of independent variable 
parameters and dependent variable 
measures (Denenberg, 1962a). Mey- 
ers (1962) and Lindholm (1962) have 
recently arrived at a similar conclusion. 

Then should the term “critical 
periods” be abandoned when discussing 
the effects of infantile stimulation? 
Not at the present time, certainly, The 
concept has had, and will continue to 
ye great heuristic value. However, 
conclusions concerning critical peri 
must be limited n a 
perimental operations involved until 
enough empirical evidence has been 
obtained to indicate the limits of gen- 
eralization. In terms of current re- 
search strategy it appears more reason- 
able to study the functional relationships 
among various classes of independent 
and dependent variables between birth 
and weaning than to design experi- 
ments which try to isolate critical 
periods during this crucial stage of 
development. 
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THEORY OF INFANTILE STIMULATION 
AND ADULT PERFORMANCE 


One approach to the problem 9 
understanding the effects of stimul 
tion in infancy is by means of th 
critical period hypothesis. A different 
approach is to examine the diverse 
research findings to see if any comm 
factor can be discerned. Such 
examination of the literature leads % 
the conclusion that stimulation admi 
istered between birth ard weaning: 
brings about a reduction ii “emotion: 
reactivity.” Furthermore, the greater 
the stimulation in infancy, the less emo: 
tional & will be in adulthood. i 
central hypothesis of this paper is that 
emotional reactivity is reduced as 
monotonic function of amount of stim 
lus input in infancy. 

Before discussing this hypothesis im 
detail, and showing some of its conse 
quences, it is first necessary to define 
“emotional reactivity” and show that 
this intervening variable is modified by 
stimulation in infancy. This will be 
done by showing, on the stimulus side 
that different methods of stimulating & 
in infancy have similar consequences M 
adulthood; and, on the response side, 
by showing that different operational 
measures of emotional reactivity yield 
internally consistent results (Miller, 
1959). The two most common meth- 
ods of stimulating Ss in infancy have 
been handling and shocking. 
tional reactivity has been measured by 
procedures as divergent as open-field 
testing, consummatory behavior, be, 
havior in a learning situation, amt 
behavior when given an opportunity 
to emerge from the home cage. 


Handling and Emotional Reactivity — 


Open-field Behavior. In two expet 
ments Denenberg and Morton (19648) 
found that rats handled daily betwee 
birth and weaning were significant) 


controls, 
tained by Denenberg and 
(1963). Rats handled for the first 10 
or 20 days of life were found to be 
significantly more active and to defe- 
cate significantly less than nonhandled 
controls (Denenberg, Morton, Kline, 
& Grota, 1962). In a study relating 
infantile stimulation to age of testing 
Denenberg and Smith (1963) handled 
or shocked rats during the second 10 
days of life and tested i 
groups at 50, 100, 150, or 200 days of 
age. Both groups which received 
stimulation in infancy were signif- 
cantly more active than the controls 
and had a significantly lower defecation 
percentage. The open-field test was 
also used by Schaefer (1963), but his 
index of emotionality was the amount 
of time spent crouching during the 2 
minutes after presentation of a sharp 
click. Handled Ss crouched signifi- 
cantly less than nonhandled controls. 
Consummatory Behavior. Levine 
(1957, 1958) handled, shocked, or did 
not disturb rats between birth and 
weaning. When adult, the Ss were 
deprived of water for 18 hours and 
then given the opportunity to drink. 
Levine (1957), 
hypothesized that since deprivation consti- 
tutes a novel internal stimulus complex for 
nonhandled Ss, the novelty should result 
in greater emotional disturbance and produce 
reduced water consumption following a 
Period of deprivation [p. 609]. 
In both experiments handled and 
shocked Ss consumed significantly 
more water than nondisturbed controls. 
In a similar type of experiment Spence 
and Maher (1962a) also found that 
handled Ss consumed significantly 
more water than controls. 
Emergence-from-Cage Behavior. 
Hunt and Otis (1963) exposed their 
experimental rats to a variety of stimu- 
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were significantly 
defecated significantly less than non- 
disturbed controls when tested at 50, 


0.8 milliampere shock on Day 4 of life 
were significantly more active than Ss 
receiving the same stimulation on Day 
2; the latter group did not differ from 
nondisturbed controls (Kline & Denen- 
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berg, unpublished). The effect of 
electric shock is measurable at weaning 
as well. Rats given 30 seconds of 0.2 
milliampere current on Days 1 and 2 
of life were significantly less emotional 
than nondisturbed controls at the time 
of weaning as measured by time to 
emerge into the open field and activity 
in the field (Denenberg, Carlson, & 
Stephens, 1962). 

Consummatory Behavior. Rat pups 
were given 3 minutes of electrical stim- 
ulation (variable current from 0.10 
milliampere to 0.37 milliampere, dis- 
regarding S’s resistance) from Day 1 
through Day 20. These Ss consumed 
significantly more water in adulthood 
than nondisturbed controls; the find- 
ings were replicated in a second experi- 
ment (Levine, 1957, 1958). A similar 
finding was obtained by Lindholm 
(1962) who shocked rats for the first 
10 days, the second 10 days, the first 
20 days of life, and did not disturb 
a control group. No specified shock 
level was used; instead, the shock was 
set high enough to make the Ss move 
and squeal in the apparatus. The in- 
tensity of the shock was increased as 
S matured. When adult, the Ss were 
deprived of water for 24 hours and 
then given an opportunity to drink. 
The Ss shocked in infancy took signi- 
ficantly less time to initiate drinking 
and consumed significantly more water 
than the nondisturbed group. A subse- 
quent analysis determined that the dif- 
ferences in consummatory behavior 
could all be attributed to the latency to 
approach the water tubes. 


Stimulus Input and Emotional Reactiv- 
ity: The Monotonicity Hypothesis 
The data summarized above are 
sufficient to justify the conclusion that 
stimulation in infancy will result in a 
reduction in emotional reactivity in 
adulthood. It remains to be demon- 
strated that a monotonic relationship 
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obtains between amount of stimulus in- 
put in infancy and emotional reactivity. 
Two experiments have explicitly tested 
this hypothesis (Denenberg, Carlson, 
& Stephens, 1962; Denenberg & Smith, 
1963). In addition, there are three 
other experiments in the literature 
which offer support for the hypothesis 
(Levine, 1957, 1958; Lindholm, 1962). 

Amount of stimulus input was varied 
by Denenberg and his co-workers by 
handling rats for 0 (control), 10, or 20 
days in infancy. In adult open-field 
testing the controls had the highest 
defecation rate while the group handled 
for 20 days had the lowest rate; all 
differences were significant. In the 
second study (Denenberg & Smith, 
1963) one group of rats received 3 
minutes of shock daily from Day II 
through Day 20, a second group Was 
placed on the unelectrified grid (han- 
died group), and a third group was 
not disturbed. Figure 1 shows the 
percentage of Ss in each group defe- 
cating before and after adult avoid- 
ance learning training. The greater 
the stimulus input in infancy, the 
smaller (significantly) was the percent 
age of defecators in adulthood. Fur- 
thermore, the shocked Ss were the only 
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Fic. 1. Percentage of rats defecating i 
fore and after avoidance learning ô 
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Denenberg & Smith, 1963). 
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ones whose defecation was 
significantly reduced following avoid- 
ance learning training, 

Levine (1957, 1958) compared wa- 
ter consumption following 18 hours of 
thirst for Ss which had been shocked, 
handled, or not disturbed in infancy. 
In both experiments the shocked Ss 
consumed the most water, followed by 
the handled group, with the controls 
consuming the least. The difference 
between shocked and handled Ss ap- 
proached significance in the first study 
(p < .10) and was significant in the 
second study; nonhandled controls 
consumed significantly less than either 
experimental group in both studies. 

In the Lindholm (1962) experiment 
rats were shocked for the first 10 days 
of life, the second 10 days, or the first 
20 days, while a control group was not 
disturbed (his postweaning experi- 
mental group is not pertinent to this 
discussion). Consummatory behavior 
following 24 hours of thirst found that 
the controls had the greatest latency 
and consumed the least amount of wa- 
ter, followed by the two 10-day groups, 
with the 20-day group haying the short- 
est latency and ingesting the greatest 
amount of water. 

To summarize: Stimulus input in 
infancy has been varied by number of 
days of handling (0, 10, 20) (Denen- 
berg et al., 1962), number of days of 
shock (0, 10, 20) (Lindholm, 1962), 
and form of stimulation (nondisturbed 
controls, handled, shocked) (Denen- 
berg & Smith, 1963; Levine, 1957, 
1958). Emotional reactivity has been 
measured by open-field behavior and 
consummatory behavior. In each in- 
stance the greater the amount of stimu- 
lus input in infancy the less was the 
level of S's emotional reactivity in 
adulthood. Figure 2 presents an ideal- 
ized curve showing this theoretical 
relationship. The negatively acceler- 
ated form of the curve is suggested by 
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Fic. 2. Theoretical curve relating stimu- 
lus input in infancy to emotionality in 
adulthood, 


the findings of the five experiments 
cited above. In each instance the 
greatest change in behavior occurred 
between the control group and the 
group receiving the intermediate 
amount of stimulation. 


Relationship between Emotional 
Reactivity and Adult Performance 

When one examines the dependent 
variables used in studies of infantile 
stimulation, it is apparent that many 
of them contain some form of noxious 
element (avoidance learning, under- 
water swimming, thirst, starvation). 
Obviously, then, S’s emotionality or 
level of arousal will have a significant 
effect upon his performance in such 
tasks. It is reasonable to expect that 
there will be an optimal level of emo- 
tionality for efficient performance. As 
one moves away from this optimal 
level, performance should drop off, thus 
resulting in an inverted U function 
(cf. Hebb, 1955). 

However, another parameter is 
needed. The Yerkes-Dodson Law 
(Broadhurst, 1957) posits that the op- 
timal level of motivation for a task 
decreases as task difficulty increases. 
Assuming that the more emotional S is 
more motivated, it follows that highly 
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emotional Ss should have the best 
performance when the level of task 
difficulty is quite low while the least 
emotional Ss should be the best per- 
formers when the task is very difficult 
(Broadhurst, 1957; Karas & Denen- 
berg, 1961). In both of these instances 
the relationship between performance 
and emotionality should be monotonic, 
through opposite in slope. It is only 
when a task is of “moderate” difficulty 
that one should expect to obtain the 
nonmonotonic inverted U function. 
Figure 3 presents the theoretical rela- 
tionship between adult performance 
and S’s emotionality for tasks of dif- 
ferent levels of difficulty.‘ 


Some Relationships between Stimulus 
Input in Infancy and Adult 
Performance 


A number of experiments have been 
carried out, the results of which are 
consistent with the predictions made by 
the theory depicted in Figures 2 and 3. 
For example, Karas and Denenberg 
(1961) explicitly tested the theory by 
assuming that the greater the number 
of days of handling in infancy, the 
greater the reduction in emotional re- 
activity in adulthood. In addition, 
they assumed that “spaced” handling 
would result in greater emotional re- 
duction than “massed” handling. Ex- 
perimental rats were space or mass 
handled for 10 or 20 days while con- 
trols were not disturbed. The cri- 
terion task was an underwater dis- 
crimination Y maze similar to the one 
used by Broadhurst (1957) who had 


4 Still another parameter which may be 
relevant is rate of development. Denenberg 
and Karas (1959), in comparing the effects 
of handling upon the rat and mouse, point 
out that the mouse is a more rapidly develop- 
ing organism and they suggest the hypothesis 
that the more rapid an organism’s develop- 
ment, the greater the effect of infantile 
experience, 
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Fic. 3. Theoretical relationship between 
performance in adulthood and emotionality 
level for tasks of varying degrees of diffi- 
culty. 


shown that maze performance was re- 
lated to S’s level of emotionality. The 
discrimination task was an “easy” one 
(86.6% correct choices, highly compa- 
rable to Broadhurst’s value of 86.4% 
errorless trials on his “easy” task), thus 
leading to the deduction that the most 
emotional Ss should exhibit the best 
performance with a monotonic decline 
in performance as emotional reactivity 
decreased (see Figure 3). Analysis 
of the swimming time scores found that 
the rank order of the five groups was 
as predicted, thus confirming the 
hypotheses. £ 
Several studies have found an in- 
verted U relationship between amount 
of stimulation in infancy and later per- 
formance. In the experiments by 
Denenberg (19622) and Denenberg 
and Karas (1960, 1961) rat pups 
were handled for 0 (controls), 3, 5, 
10, or 20 days in infancy. One can 
average the data for the 3-day, 5-day, 
and 10-day groups (irrespective of age 
of stimulation), add in the findings of 
the control and 20-day groups, and plot 
the functional relationship betwee? 
number of days of handling in infancy 
and later performance. The averaging 
procedure acts to partially balance out 
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the age of stimulation. One can, in is shown in Figure 4 for 21- and 69- 
other words, disregard “critical day body weight, avoidance learning, 
periods” between birth and weaning and survival time (Denenberg, 1962a). 
and examine amount of stimulation The general function of these data may 
within that period. Such an analysis be described by an inverted U curve. 
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Fic. 5. Number of avoidance learning responses in rats as a function of infantile shock 
intensity on Day 2 or Day 4 (from Kline & Denenberg, 1964). 


Both too much and too little handling 
led to less than optimal performance. 
The i amount of handling 
varied with different dependent vari- 
ables. 

This same general function was ob- 
tained when electric shock was used to 
stimulate Ss in infancy. Rats were 
shocked on the second or fourth day 
of life using 0.2 milliampere, 0.5 milli- 
ampere, or 0.8 milliampere electricity ; 
one group of controls was handled, 
another not disturbed (Denenberg & 


Kline, 1964; Kline & Denenberg, 
1964). The Ss were given avoid- 


ance learning training between 60 and 
69 days of age. Figure 5 presents 
the relationship between intensity of 
stimulation in infancy and adult learn- 
ing. The inverted U function is clearly 
present. 

Evidence that this inverted U func- 
tion has some degree of interspecies 
generality is seen in Figures 6 and 7, 
which show the performance of mice. 
Figure 6 summarizes an experiment by 
Denenberg (1959) in which Ss were 
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given classical buzzer-shock conditi 
ing at 25 days under 0.2 milliamp 
0.5 milliampere, or 0.8 milliampere 
shock. At 50 days they were taken U 
an extinction criterion, split into third 
and reconditioned under 0.2, 0.5, 0 
0.8 milliampere shock. The adult re 
conditioning data are plotted as a fume 
tion of shock level at 25 days. 1 
inverted U function adequately d 
scribes all three curves. 
Bell and Denenberg (1963) gi 
mice 0.1 milliampere, 0.3 milliamp 
or 0.5 milliampere shock in infa 
handled other groups (0.0 milliamp 
and did not disturb still others. £ 
avoidance learning performance, plotte 
in Figure 7, again reveals the invert 
U function. ; 
The relationship between stin 
input in infancy and adult perfor 
aids in interpreting other data. 
example, Levine, Chevalier, and 
chin (1956) found that rats han 
for the first 20 days of life perfo 
best in an adult avoidance lear 
task, followed by Ss which had be 
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3S 


MILLIAMPERES OF SHOCK 


Fic. 6. 50-day conditioning scores in mice for three shock levels as a function of 
infantile shock intensity (from Denenberg, 1959). 


‘handled and shocked, with nonhandled 
controls the poorest learners. These 
data exhibit a rough inverted U func- 
tion with the group receiving the inter- 
mediate amount of stimulation exhibit- 
ing the best performance. 

The theory depicted in Figures 2 and 
3 leads to the prediction that Ss with 
the greatest amount of stimulus input 
in infancy should have the best per- 
formance when the task is quite diffi- 
cult. There are, as yet, no data rele- 
vant to this prediction. 

It should be possible to generalize 


the of “task difficulty” be- 
yond the usual learning context. For 

, the inverted U curves for 
body weight and survival time in Fig- 
ure 4 were obtained after Ss had under- 
gone avoidance learning training. For 
Ss which did not receive avoidance 
training, the controls lived longest, 
followed by Ss handled for 10 days, 
with the 20-day group dying earli- 
est. In other words, those data paral- 
lel the “easy” curve in Figure 3 while 
the results of the groups which had 
avoidance training follow along the 
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Fic. 7. Number of avoidance responses in 
5 shocked 


(C: nonstimulated controls; O: handled, non 
from Bell & Denenberg, 1963.) 


mice as a function of stimulation in infancy. 
; 0.1, 0.3, 0.5: milliamperes of shock, 
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curve of the “moderate” group. Since 
avoidance learning has been shown to 
be a stressor (Brady, Porter, Conrad, 
& Mason, 1958; Denenberg & Karas, 
1961), this suggests the hypothesis that 
stress experience, prior to a terminal 
stress, may act in a manner similar to 
the task difficulty parameter. This 
leads to the interesting prediction that 
Ss subjected to a severe, but nonlethal 
stress, would be best able to survive a 
subsequent lethal stress, if they had 
received a considerable amount of stim- 
ulation in infancy. 

It is pertinent to ask here about the 
relationship between stimulation in in- 
fancy and performance on a task which 
minimizes the emotional component. 
One would expect little or no relation- 
ship between infantile stimulation and 
later behavior. This is exactly what 
Denenberg and Morton (1962b) found 
in a series of three experiments. Han- 
dling in infancy did not have any effect 
upon problem-solving behavior as 
measured by the Hebb-Williams maze 
test. The authors concluded that, 
These findings suggest that preweaning stim- 
ulation such as shock and handling affects 
emotional processes but does not have any 
direct effect upon perceptual or problem- 
solving behavior [p. 1098]. 

Schaefer (1963) has obtained similar 
results. 

Similarly, Spence and Maher 
(1962b) suggested that the obtained 
differences in learning performance 
between Ss stimulated in infancy and 
controls may be a function of emotional 
rather than learning factors. Their 
experiment on this point, while equivo- 
cal, does offer some support for this 
position. 


Comparisons with Bovard’s Theory 


Though Bovard’s (1958) theory is 
concerned with the effects of handling 
upon viability, there are certain points 
of agreement and disagreement to he 
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noted between his position and the one 
presented here. The major point of 
disagreement is that Bovard combines 
the results of preweaning and post- 
weaning handling (the term “gentling” 
has been commonly used to describe the 
procedure of manually manipulating 
rodents postweaning), while this 
theory has been limited to preweaning 
stimulation. There are logicu! and em- 
pirical reasons for making such a dis- 
tinction. The postweaning rat is a 
very different organism, behaviorally 
and biologically, from the preweaning 
animal, and it is not logical to assume 
that stimulation administered to the 
immature, recently born rat has the 
same consequences as the same stimula- 
tion administered to a weanling. The 
experiments cited in Footnote 3 offer 
general support for this conclusion. 
Clear evidence that handling and elec- 
tric shock after weaning do not affect 
rats in the same manner as handling 
and shock before weaning can be found 
in the monograph by Brookshire, Litt- 
man, and Stewart (1961). 

A paper directly relevant to this 
issue is the one by Levine and Otis 
(1958). They either handled or gen- 
tled rats before or after weaning; 3 
control group was not disturbed. Pre- 
weaning handling or gentling had the 
same effect of significantly increasing 
body weight and survival rate while 
postweaning handling or gentling had 
no significant effect, compared to the 
control group. In addition to showing 
that preweaning stimulation has differ- 
ent consequences from postweaning 
stimulation, the Levine and Otis 
(1958) paper failed to substantiate 
Weininger’s (1953, 1956) results 
which Bovard used as part of the em- 
pirical base for his theory. Ader 
(1959) and J. H. Scott (1955) have 
also failed to replicate Weininger 
findings. Even the Hammett (1922) 
reference cited by Bovard is not sup- 
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portive of his position, since the ex- 
perimental data reported in that paper 
was not concerned with the effects of 
gentling rats. Hammett does refer 
back to his 1921 paper, and it is that 
portion of Hammett's paper which 
Bovard refers to in his reference to 
Hammett. Several other researchers 
have also cited the Hammett (1921) 
paper when discussing the positive ef- 
fects of postweaning gentling. This 
is most interesting, since Hammett 
presents absolutely no data supporting 
such a conclusion. Hammett (1921) 
carried out five experiments concern- 
ing the effects of parathyroidectomy or 
thyroidectomy upon survival. The 
first three studies do not permit a valid 
evaluation of the effects of gentling be- 
cause of lack of appropriate controls 
(though the experiments were valid 
for Hammett’s purposes). The fourth 
and fifth experiments (see Hammett, 
1921, p. 201) can be used to evalu- 
ate the effects of gentling since Ham- 
mett shifted the rats at weaning from 
one colony room to the other, thus 
balancing environmental conditions for 
the two different stocks of animals. 
The only significant effect in the fourth 
experiment was that the two colony 
stocks differed in genetic make-up; 
gentling failed to have any effect. And 
the fifth experiment contradicts this: 
all rats, regardless of genetic stock or 
whether or not they were gentled, sur- 
vived the parathyroidectomy operation. 
One can only conclude that those who 
have cited Hammett’s work as proof 
that gentling enhances survival ability 
have allowed their fertile imagination 
to inhibit their reading comprehension. 
All these experiments suggest that the 
postweaning handling (or gentling) 
phenomenon is rather tenuous with 
respect to modification of the rat’s 
physiology. 

There is some degree of agreement 
between Bovard’s theory and the pres- 
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ent one concerning preweaning stim- 
ulation. In so far as behavioral indices 
of emotional reactivity correlate with 
what Bovard (1958) calls stress, then 
the two theories are in agreement that 
“early [preweaning] handling raises 
the threshold for response to stress 
Ip. 259].” Bovard, however, does not 
believe that handling is stressful while 
Levine (1956) and Denenberg (1959) 
have taken the opposite position, Bo- 
vard believes that “early stress lowers 
resistance to later stress [p. 260].” 
Since electric shock—a stimulus typi- 
cally considered to be stressful—and 
handling have similar functional prop- 
erties vis à vis emotional reactivity, the 
Levine and Denenberg position appears 
to be on firmer ground. Further evi- 
dence that stressful stimulation in in- 
fancy can have beneficial results, rather 
than the deleterious results predicted 
by Bovard, is discussed in the next 
section. 
Discussion 


The hypothesis that emotionality de- 
creases as a monotonic function of 
stimulus input may not appear reason- 
able at first glance because of our 
knowledge of conditioned fears, trau- 
matic experiences in childhood, etc. 
It must be emphasized that this hy- 
pothesis is based upon, and is restricted 
to, data obtained on rodents which 
have been stimulated prior to weaning, 
Weaning has been used as a criterion 
point, not because of any clinical 
implications concerning mother-young 
separation, but because all of S’s senses 
are functioning by this time and be- 
cause it is only after weaning that there 
is any evidence of long-term retention 
of a learned fear response (Campbell & 
Campbell, 1962; Denenberg, 1958, 
1959; Lindholm, 1962). Furthermore, 
the consequences of handling or elec- 
tric shock after weaning have been 
shown to be very different from their 
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effects prior to weaning (Brookshire, 
Littman, & Stewart, 1961). 

Scott (1962) has made a similar 
suggestion and has indicated that the 
transition point may occur at about 16 
days of age, when the eyes open. Up 
to that age the critical factor mediating 
infantile stimulation is hypothesized by 
Scott to be the adrenal cortical stress 
mechanism while the psychological 
process of reduction of fear through 
familiarity is presumed to predominate 
between 17 days when the eyes open 
and 30 days. 

The traditional assumption that 
traumatic experiences in infancy must 
inevitably have deleterious effects in 
later life is clearly not consonant with 
many of the studies cited above. Sev- 
eral other experiments, in fact, may 
also be noted which contradict this 
classical assumption. Baron, Brook- 
shire, and Littman (1957) found that 
rats given 1.25 milliamperes of electric 
current continuously for 3 minutes on 
Days 20 and 21 were better escape and 
avoidance learners in adulthood than 
controls. Infant rats either placed on 
a laboratory shaker which oscillated 
180 times per minute or shocked with 
0.1 milliampere electricity were found 
to exhibit earlier maturation of the 
adrenal ascorbic acid depletion re- 
sponse to stress (Levine & Lewis, 
1959b). Using the same dependent 
variable Schaefer, Weingarten, and 
Towne (1962) showed that placing rat 
pups into a refrigerator at 7° to 10° 
Centigrade also resulted in earlier 
maturation of the adrenal responses. 
Finally, Werboff and Havlena (1963) 
induced febrile convulsions in 3-day 
old rats by means of microwave dia- 
thermy. The Ss so treated weighed 
significantly more at weaning and were 
significantly more resistant to audio- 
genic seizures. 

Certain parallels may be noted be- 
tween this theory and the arousal or 
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activation theories of Hebb (1955), 
Duffy (1957), and Malmo (1959). 
All the theories are concerned with 
the intensity dimension of stimulation 
or arousal, and the inverted U function 
relating the intensity dimension to 
performance is common to all. The 
theory proposed here contributes to 
general arousal theory in two ways. 
First, intensity of infantile stimulation 
is specifically implicated as a major 
parameter affecting later differences in 
“chronic” or general level of arousal. 
Brookshire et al. (1961) have isolated 
what may be a similar phenomenon 
with the postweaning rat. They sug- 
gest that their “pure shock” residual 
factor of shock trauma may act to 
modify S’s arousal level. The second 
contribution is a methodological one: 
the techniques used in infantile stimu- 
lation are also procedures for experi- 
mentally generating individual differ- 
ences in chronic arousal level. This 
opens up a new avenue of attack for 
those interested in individual differ- 
ences in arousal level. 

Though the proposed theory does 
account for a considerable number of 
experimental findings, contrary data 
are to be noted. For example, Spence 
and Maher (1962a) subjected rats to 
intense auditory stimulation between 
birth and weaning, handling a second 
group but did not give them auditory 
stimulation, moved the cages of a third 
group, and did not disturb a fourth 
group. Using water consumption as 
their index of emotionality no differ- 
ences were found among the experi- 
mental groups though the stimulate 
Ss consumed significantly more water 
than the undisturbed controls. No 
reason can be suggested to account for 
the failure to find a monotonic relation- 
ship between stimulus intensity in 1- 
fancy and consummatory behavior. 

Finally, this theory is not meant to 
be a substitute for the critical period 
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hypothesis. In fact, the functional re- 
lationships are most clearly seen when 
Ss are stimulated at different ages (but 
for the same number of days) and the 
date averaged so that age of stimula- 
tion is equated, Ultimately, any gen- 
eral theory of infantile stimulation will 
have to account for both age of stimula- 
tion and stimulus input and will have 
to relate these to the psychology and 
biochemistry of ontogeny (Levine, 
1962). 
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Studies of schizophrenia have been plagued, perhaps more so than any 
other area of psychological research, by the confounding effects of 
impaired attention on laboratory and clinical behaviors. This paper 


analyzes the perceptual response 
inattentiveness in schizophrenia. 


determinants of attentiveness and 
A “cognitive control” theory of 


attention is presented and elaborated through the use of research and 
clinical data. An attention-response mechanism termed “sensory input 
processing-ideational gating” is then formulated in an attempt to 
explain some heretofore incomprehensible aspects of schizophrenic 


disorders. 


Psychiatrists have long regarded 
“impaired attention” as a predominant 
feature of schizophrenia (Kraepelin, 
1919). Recently a number of psycho- 
logical studies have again highlighted 
the significance of attention defects in 
schizophrenics. Weckowicz and Blew- 
ett (1959), in a study on size constancy 
and abstract thinking concluded : 


the abnormalities of thinking and perception 
in schizophrenic patients can be described as 
an inability “to attend selectively” or “to 
select relevant information” [p. 927]. 


Donahoe, Curtin, and Lipton (1961) 
after studying verbal learning stated: 


preliminary work strongly suggests that per- 
formance deteriorates inordinately if the 
schizophrenic is not allowed to program the 
stimuli himself [p. 557]. 


1This paper was begun during the term 
of a predoctoral fellowship at the University 
of Michigan and was supported, in part, by 
Public Health Service Fellowship MF 17981 
from the National Institute of Mental Health, 
Public Health Service. It was completed 
during the period of a Veterans Administra- 
tion Psychology Research Associateship at 
the Palo Alto Veterans Administration Hos- 
pital. 

The author wishes to express his apprecia- 
tion to Enoch Callaway, Norman Garmezy, 
Paul McReynolds, and especially to Mar- 
garet Singer for their most helpful advice 
and criticisms. 


Shakow (1962), reviewing a number 
of studies by him and his colleagues on 
set impairment in schizophrenia, theo- 
rized: 

It is as if, in the scanning process which 
takes placă before the response to a stimulus 
is made, the schizophrenic is unable to select 
out the material relevant for optimal response 
[p. 9]. 

While the above statements are far 
from startling, there is a paucity of 
researches which elaborate upon the 
above notions. Most of the psycho- 
logical studies of schizophrenics report 
again and again and again that schizo- 
phrenics are sicker and do more poorly 
than normals on “experimental tasks 
or conclude that some schizophrenics 
are not as sick as other schizophrenics- 
In the following pages, several hy- 
potheses about the process of attentio! 
ill be present 
potheses it is possible t 
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tention problems in schizophrenia. 
Attention can be described in term? 


of both extensive and selective factors 
Pitici to regulate the reception 


and utilization of stimulus input (€-8% 
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1960; Santos, Farrow, & 
Haines, 1963; Solley & Murphy, 
1960). The extensive aspect of atten- 
tion refers primarily to degree to 
which stimuli are sampled from the 
environment. It is inferred from labo- 
ratory tasks which emphasize percep- 
tual scanning (e.g., Gardner, Holzman, 
Klein, Linton, & Spence, 1959). Se- 
lectiveness of attention involves the 
articulation of a stimulus field into 
salient and irrelevant cues. It is in- 
ferred from tasks which necessitate 
attention to certain segments of a 
stimulus field and simultaneous inhibi- 
tion of attention to other segments of 
the field. This paper accounts for 
striking individual differences between 
schizophrenics on laboratory measures 
of perceptual scanning and field articu- 
lation and demonstrates systematic re- 
lationships between attention-response 
styles and schizophrenic symptom pat- 
terns and pre-illness histories. Ex- 
treme forms of scanning and field 
articulation, minimal or excessive, are 
shown to characterize the attention- 
Tesponse dispositions of most schizo- 
phrenics. 

The significance of these response 
dispositions, derive, on the one hand, 
from their relationships to personality 
strueture and, on the other hand, 
through their use in the analysis of 
basic perceptual orientations to the 
environment. A detailed consideration 


Berlyne, 


postulation of an “i 
mechanism“ in sel 
functions to filter or to dis 
ing connotative environm' 


Subgroups within Si chisoph 


The designation of di tic criteria 
which differentiate amo the diverse 
forms of schizophreni: is sorely needed. 


Undesirable heterog 
Tent diagnostic classifications is grossly 
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* Foulds & Owen, 1963; 
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evident in many studies of schizophre- 
nia in which both interindividual and 
intraindividual variability are discour- 
agingly great. In the course of this 
review, an analysis of attention re- 
sponse styles will be used to delineate 
sources of correlation and also of con- 
fusion among the most prominent cate- 
gorizations of schizophrenia. It will 
be shown that, despite reports of un- 
reliability of diagnosis, there is much 
to recommend the use of classical 
Kraepelinian diagnosis of schizophrenic 
subtypes in selecting subjects for re- 
search. This will apply particularly to 
the paranoid-nonparanoid differentia- 
tion (e.g., Shakow, 1962). The utility 
of the process-reactive distinction also 
will be shown to be extremely impor- 
tant (e.g., Garmezy & Rodnick, 1959). 
Additionally, it will become clear that 
Kraepelinian subgroupings and the 
process-reactive dichotomy are inter- 
twined with chronicity factors and 
physiological arousal patterns in com- 
plex ways but that the lack of adequate 
experimental controls in many studies 
has served to obscure the nature of 
such complex interactions. 
Paranoid-Nonparanoid Distinction. 
Numerous experimental and clinical 


Izard, 1962; Jackson, 1960, Kraepelin, 
1919; Lynn, 1963; Raush, 1952; Sex- 
ton, 1945; Silverman, 1963a, 1964a, 
1964b; Snyder, 1961; Voth, A. C., 
1947). In addition, several recent 
papers have pointed to the theoretical 
and research confusion which occurs 
when paranoid schizophrenics are not 
differentiated from nonparanoid schizo- 
phrenics (McConaghy, 1960; Shakow, 
1962; Wynne & Singer, 1963b). 

The central symptom in the para- 
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noid schizophrenic which distinguishes 
him from other Kraepelinian subtypes 
is the prominence of a delusional sys- 
tem in his thinking. The delusional 
system may be highly elaborated and 
systematized, relatively unsystema- 
tized, or over time it may become less 
systematized. Indeed, degree of sys- 
tematization of delusions and the tem- 
poral or chronicity factor appear sig- 
nificantly correlated with differences 
in attention response patterns among 
paranoid schizophrenics. In contrast 
to the paranoid subgroup the predomi- 
nant symptoms of other schizophrenic 
subgroups include stupor, negativism, 
excessive motor activity (catatonia) ; 
inappropriate affect, bizarre manner- 
isms, regressive behaviors (hebephre- 
nia) ; emotional impoverishment, slow 
deterioration of behavior, social with- 
drawal, and fantasy preoccupation 
(simple schizophrenia). 
Process-Reactive Distinction. A sec- 
ond major categorization of schizo- 
phrenic patients is based primarily 
upon developmental and prognostic fac- 
tors (Becker, 1956; Fine & Zimet, 
1959; Garmezy & Rodnick, 1959; 
Kantor, Wallner, & Winder, 1953). 
Many laboratory experiments indicate 
such clarification reduces perform- 
ance variability between experimental 
groups and yields signficant differ- 
ences between the two groups of 
schizophrenics (e.g., Garmezy & Rod- 
nick, 1959; Herron, 1962; Rodnick & 
Garmezy, 1957). The process label is 
applied to patients who exhibit poorly 
integrated prepsychotic personalities 
with a continuous and prolonged in- 
sidious development of schizophrenic 
symptoms and who have a poor prog- 
nosis. Reactive schizophrenics show a 
relatively adequate premorbid adjust- 
ment, a rapid onset, a stormy clinical 
picture and have a relatively good 
prognosis. The terms poor-premorbid 
adjustment and good-premorbid ad- 
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justment are often used interchange 
ably with process and reactive. 3 
only difference between the two 
chotomies stems from an assumpti 
within the process-reactive construct 
that some kind of organic substrate 
underlies the pathology of the process 
group but not that of the reactive 
group. No such assumption is made 
in the poor-premorbid-good-premorbid 
differentiation. However, this does 
not appear to be relevant to the man- 
ner in which subjects are selected for 
research purposes nor are the various 
scales which define the two concepts 
sufficiently different to justify consid- 
ering them separately. 


COGNITIVE CONTROLS OF ATTENTION 


Recent studies of individual con- 
sistencies in cognitive behavior have 
emphasized the importance of enduring 
response dispositions which are rela- 
tively invariant over broad ranges 
laboratory tasks and environmental 
conditions (Gardner et al., 1959; 
Klein, 1958). These response dispo- 
sitions, termed “cognitive control prin- 
ciples” have been conceptualized as 
more or less automatic adjustments to 
the environment which emerge in 
course of development and which are 
shaped by both constitutional and ex- 
periential factors. Cognitive con 
principles refer to various modes 0 
organizing percepts, ideas, memories, 
sensations, etc. These organizational 
tendencies vary from person to person 
and reflect typical strategies of coordi- 
nating private perceptual and concep- 
tual experiences with those of the “real 
world.“ 2 An individual’s characteris 


2 For example, some people have an under- 
lying preference for making fine discrimina- 
tions about nearly everything. This trait 
may be an asset under certain conditions but 
it also may prove to be a nuisance in o 
situations, e.g., the scientist who is always 
“splitting hairs” both at work and at hom 
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tic pattern of utilizing these organiza- 
tional tendencies determines the out- 
come of his coordinations of inner 
experiences with things and events and 
constitutes his “cognitive style.” 

Cognitive controls of attention have 
been defined in terms of (a) subject 
differences in the extensiveness with 
which stimuli are sampled when attend- 
ing to a sensory or perceptual field— 
the “scanning control,” and (6) in 
terms of subject differences in attention 
to certain segments of complex stimu- 
lus fields and simultaneous inhibition 
of attention to other segments of these 
fields—the “field-articulation control.” 
A factor analysis of scores on a number 
of tests of scanning and field articula- 
tion, from a large sample of subjects, 
yielded two orthogonal factors (Gard- 
ner, 1961). This finding of independ- 
ence between the two attention control 
principles was confirmed in a subse- 
quent study by Gardner and Long 
(1962a). These investigators also 
studied various perceptual conditions 
in which scanning and field-articulation 
response dispositions codetermined 
perceptual judgments. 


Scanning Control Principle 


Piaget (1950), in his discussions of 
developmental factors related to atten- 
tion, suggests that early in life per- 
ception is subject to much distortion. 
The young child’s attention appears to 
anchor upon dominant objects in the 
stimulus field and automatically causes 
him to overestimate stimuli in the cen- 
ter of the field. These “centration 
effects” are an inherent feature of the 
functioning of the perceptual apparatus. 
In the course of development, the indi- 
— ... aE ene arene 


Another example—some people are readily 
influenced by pressures from their immediate 
environments. Others rely primarily on in- 
ternal cues to guide them in making decisions 
or in accepting or not accepting popular 
Opinions, 
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vidual ordinarily learns to minimize 
these misperceptions of apparent size 
by shifting his attention to and from 
the center of the perceptual field. In 
numerous experiments with both chil- 
dren and adults, Piaget and his col- 
leagues have demonstrated that objects 
in the center of an individual's percep- 
tual field are overestimated. Thus, in 
certain studies of ual constancy 
children tend to show underconstancy, 
ie, to overestimate a near object in 
relation to a far object because 
“center” on the near object. Adults, 
on the other hand, tend to overestimate 
the distant object (overconstancy) be- 
cause they “center” more upon the 
distant object. The child's judgments 
tend to be made in terms of the sizes 
of objects as they are recorded on the 
retina. In contrast, the normal adult 
appears to overcompensate for this type 
of error tendency. 

Errors in judgment of apparent 
magnitude have also been found to be 
a function of the length of time during 
which subjects “center” upon stimuli 
in the course of making perceptual 
judgments (Piaget, Vinh-Bang, & 
Matalon, 1958). The longer the dura- 
tion of a centration, i.e., the longer the 
stimulus is viewed, the greater is the 
tendency toward overestimation of the 
stimulus. Errors of overestimation of 
standard stimuli are regarded as a 
general tendency to measure less stable 
parts of the perceptual field (the varia- 
ble stimuli) in terms of the more stable, 
the standard stimuli. 

Gardner and his colleagues (Gard- 
ner, 1961, Gardner et al., 1959, 1960, 
Gardner & Long, 1962a, 1962b) have 
elaborated upon Piaget’s formulation of 
centration effects. They have shown 
that individuals who scan a visual field 
extensively—who repeatedly look back 
and forth from one segment of the field 
to other segments of the field—evidence 
minimal overestimation or underesti- 
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mation of a standard stimulus in cer- 
tain size estimation procedures. On 
the other hand, limited scanning be- 
havior is associated with large over- 
estimations of the standard. Precise 
recordings of scanning behaviors have 
been made by filming eye movements 
during size estimation trials (eg, 
Gardner & Long, 1962b). In addition, 
significant individual consistencies lead- 
ing to the formulation of the “scanning 
control principle” have been observed 
in tests of perceptual constancy, inci- 
dental noticing, and in a free association 
test which requires the subject to select 
associations from internal memory 
schemata (Gardner et al., 1959). Sub- 
jects characterized by extensive scan- 
ning responsiveness have also been 
found to show more Rorschach indices 
of “isolation” and “projection” as com- 
pared with limited scanners (Gardner 
& Long, 1962b). Finally, studies by 
Farrow and Santos (1962) and Solley 
and Santos (1958) indicate that indi- 
vidual differences on the autokinetic 
illusion and the Necker cube reversible 
figures procedure may be interpreted 
in terms of differences in scanning be- 
haviors. These latter studies demon- 
strate that the scanning response, like 
any other response, is directly affected 
by positive or negative reinforcement 
(also Santos, Farrows, & Haines, 
1963). 

The scanning mechanism takes on 
special significance when applied to an 
analysis of perceptual studies of schizo- 
phrenics. Whereas normal groups 
typically show moderate scanning be- 
haviors on tasks from which scanning 
responsiveness can be inferred, schizo- 
phrenics evidence either marked scan- 
ning or minimal scanning (e.g., Harris, 
1957). These extreme response dis- 
positions in schizophrenia can be re- 
lated to the type of symptom pattern, 
pre- illness history factors, and the stage 
of the psychotic disturbance, Note that 
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this type of analysis focuses on patterns 
of constant error scores on the various 
perceptual tests rather than upon dé 
gree of accuracy or correctness. Th 
additional information which can 
obtained by this method makes pos sil 
new explorations of attention response 
patterns within and between the v 
ous types of schizophrenias. 

Perceptual Constancy. Objects of 
the same size or shape, seen at different 
distances and at different angles 0 
narily tend to be experienced as 
similar (perceptual constancy). 
phenomenon occurs despite the 
that retinal images produced by 1 
objects viewed at different distan 
may be quite different. In no 
everyday experience and under 
laboratory conditions, there is a tem 
ency for overcompensation in the form 
of overconstancy to occur. Only wi 
spatial and optical cues are mark 
reduced or when instructions geared 
breaking down constancy percep 
are used, does the overconstancy tene 
ency diminish in normal, not ex 
sively aroused, and nondrugged 
jects (Ardis & Fraser, 1957; Calla 
& Dembo, 1958; Carlson, 1962; Hol- 
way & Boring, 1941). 

In accord with the findings of Gat 
ner et al, (1959), the scanning conti 
mechanism will be used to interp: 
number of studies of individual diffe 
ences in perceptual constancy. As 8 
cifically applied to these studies, 
relationship between scanning and! 
ceptual constancy is hypothesized 
as follows: The more intensivel 
subject scans the perceptual field, 
greater is the tendency to over 
stancy in his perceptual jud 
On the other hand, minimal scan 
of the environment is correlated 
reduced overconstancy or unde 
stancy. Thouless (1932) stud 
perceptual constancy performan 
introverted or “schizothymic” 
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viduals and extraverted or “cyclothy- 
mic” individuals. He found that the 
perceptual performances of schizothy- 
mic patients, selected on the basis of 
Kretchmer’s descriptions, tended to be 
less overconstant than the performances 
of cyclothymic individuals. This find- 
ing of greater size constancy in extra- 
verts than in introverts was confirmed 
by Weber (1939). Ardis and Fraser 
(1957) studied shape constancy among 
subjects who were selected on the basis 
of scores on the Minnesota TSE In- 
ventory of introversion extraversion. 
A highly significant relationship was 
found between introversion as a per- 
sonality trait and reduced shape con- 
stancy. 

These perceptual differences be- 
tween introverts and extraverts fit well 
with H. M. Voth’s (1962) conceptuali- 
zation of the degree to which such 
individuals characteristically seek out 
external stimulation. Voth suggests 
that the actions and feelings of extra- 
verted-type individuals are determined 
primarily by external stimulus events 
(“they live by radar’). The more 
self-absorbed introverted-type evidence 
a relative independence from external 
conditions. The latter group scans the 
environment to a lesser degree than 
the former, 

Raush (1952) hypothesized that the 
markedly self-absorbed or socially 
withdrawn nonparanoid schizophrenic 
would evidence perceptual undercon- 
stancy, whereas the cautious, suspi- 
cious, and perceptually vigilant para- 
noid schizophrenic was expected to 
show extreme overconstancy. Under 
ordinary visual conditions, nonschizo- 
phrenics, paranoid schizophrenics, and 
nonparanoid schizophrenics showed 
Overconstancy. The paranoid group, 
however, was significantly more over- 
constant than the nonschizophrenic 
Sroup. When size judgments were 
made in a darkened room, paranoids 
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evidenced even greater overconstancy 
judgments than those which were made 
in the lighted room, In this condition 
overconstancy accentuation was signifi- 
cantly different from both the non- 
paranoid group and the control group; 
the latter two groups tended to reduce 
overconstancy. Similar findings were 
obtained by Hartman (1962) in 
studies of the afterimage observa- 
tions of paranoid schizophrenics with 
well-systematized delusions, nonpara- 
noid schizophrenics, and nonpsychiatric 
subjects. (The phenomenological size 
of afterimages may be considered as 
the converse of size constancy since dis- 
tance cues affect changes in size per- 
ception of an afterimage in a reverse 
manner, The performances of Hart- 
man’s experimental groups showed the 
same kinds of constancy dispositions 
as Raush’s groups, and as predicted 
in reverse order). 

Several studies have reported under- 
constancy in the size judgments of 
schizophrenics. | Weckowicz (1957) 
found chronic schizophrenics with 
hebephrenic and simple schizophrenic 
syndromes manifested underconstancy 
when compared with a group of non- 
schizophrenic mental patients and a 
group of hospital staff personnel. In 
a later report (Weckowicz & Blewett, 
1959), it was noted that hebephrenics 
with much thought disorder tended 
to respond in an underconstant manner 
whereas paranoids with little thought 
disorder showed overconstancy (p. 
912). Weckowicz, Sommer, and Hall 
(1958) found that chronic schizophre- 
nics, 17 out of 20 being diagnosed as 
nonparanoid schizophrenics of various 
types, showed poor distance constancy. 
(No performance records were pre- 
sented). Significant correlations were 
also found between poor distance con- 
stancy and underconstancy in size 
judgments. Crookes (1957) reported 
significantly lower perceptual con- 
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stancy performances in a group of 
schizophrenics, unspecified as to Krae- 
pelinian subgroup, than in groups of 
neurotics, psychopaths, and normals. 

Lovinger (1956) divided his schizo- 
phrenic subjects into “good contact” 
and “poor contact” groups on the basis 
of staff personnel ratings. Size con- 
stancy perception was studied under 
three conditions: maximized distance 
cues (binocular viewing of the stim- 
uli), minimized distance cues (mo- 
nocular viewing of the stimuli), and 
no distance cues (monocular viewing, 
reduction screen, poor lighting). All 
groups were very similar under condi- 
tions of maximum or absent cues. But 
when minimal cues were offered, poor 
contact schizophrenics achieved less 
constancy (overestimation of the closer 
object) than the good contact and con- 
trol groups, which did not differ sig- 
nificantly from each other. No signifi- 
cant differences were found between 
the constancy performances of paranoid 
and nonparanoid schizophrenics. In 
attempting to account for the discrep- 
ancy between this latter finding and 
Raush’s data, Pearl (1962) stressed 
the importance of the acute versus the 
chronic dimension of schizophrenia. 
He pointed out that Lovinger’s “poor 
contact,” underconstant schizophrenics 
were primarily chronic and that his 
“good contact,” overconstant subjects 
were mostly early schizophrenics. 
Similarly, he suggested that Raush’s 
paranoid group were primarily acutes 
and his nonparanoid group were mainly 
chronics. Whereas such an inference 
may be warranted in the Lovinger 
study, there is little basis for such a 
statement about Raush’s subjects. A 
more satisfactory explanation of Loy- 
inger’s data may rest on the separation 
of the chronicity variable from Krae- 
pelinian diagnosis. 

Autokinesis. Further indications of 
differences in scanning responsiveness 
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between paranoid and nonparanoid 
schizophrenics (and between extra- 
verts and introverts) are found in 
studies of the autokinetic effect (Hahn, 
1956; Sexton, 1945; A. C. Voth, 1941, 
1947; H. M. Voth, 1962). The auto- 
kinetic effect is a visual illusion which 
usually can be observed by looking at a 
stationary point of light, from a short 
distance, and for a period of time, in 
an otherwise dark room. Marked dif- 
ferences in amount of perceived move- 
ment in this situation have been found 
between introverts and extraverts and 
between different psychiatric groups. 
Introverted subjects perceive more 
movement than extraverted subjects 
and, within the schizophrenia classifi- 
cation, nonparanoid subgroups report 
much greater movement than paranoid 
schizophrenics. These findings may be 
viewed as an extension of the centra. 
tion effects” formulation. Piaget and 
his colleagues had demonstrated that 
differences in the perceived magnitude 
of a stimulus are a function of the 
number of repeated “centerings” upon 
the stimulus and also of the length of 
time during which subjects “center” 
upon the stimulus. It appears that in 
the autokinetic situation subjects who 
scan extensively, i.e., who repeatedly 
“center” on and away from the light 
point, evidence relatively small amounts 
of perceived movement. On the other 
hand, minimal scanning, i.e., minim 
“centering” and excessive fixation upon 
the light point, is correlated with sig- 
nificantly greater amounts of perceived 
movement. 

A. C. Voth (1947) reported a study 
in which groups of catatonic, hebe- 
phrenic, and simple schizophrenic pa- 
tients showed much perceived move 
ment of the light in the autokinetic 
situation; individuals diagnosed para- 
noid schizophrenic, true paranoie, 
paranoid condition, and jnvolutional 
paranoia reported little movement, 
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appeared that the more prominent 
paranoia, without bizarre ideation, was 
in the symptom pattern, the greater 
was the tendency to report no auto- 
kinetic movement. Jt was found that 
either extreme movement or no move- 
ment at all were poor prognostic signs. 

Sexton (1945), in his autokinetic 
study of schizophrenics, found that 36 
out of 38 nonparanoids (18 catatonic 
and 20 hebephrenic) evidenced a 
“large” or “maximum” amount of 
movement whereas 7 out of 8 para- 
noid schizophrenics evidenced lesser 
amounts of movement. He also re- 
ported that the greater the degree of 
perceived movement, the greater was 
the extent of clinically observable with- 
drawal behavior. Hahn’s (1956) find- 
ings were consistent with this and also 
indicated large amounts of perceived 
movement among nonparanoid schizo- 
Phrenics. In Fahn's experiment, 
chronic nonparanoids were assigned 
to one of three reinforcement condi- 
tions—reward, punishment, and ordi- 
nary testing conditions. They were 
tested individually in the autokinetic 
situation before and after the adminis- 
tration of success, failure, and control 
treatments. Success experiences were 
found to decrease perceived movement 
and failure experiences were found to 
increase perceived movement. Such 
data lend support to the hypothesis 
that changes in the degree and direc- 
tion of scanning of the environment 
are correlated with shifts in attention 
which have approach and avoidance 
utility (Santos, Farrow, & Haines, 
1963).* These results further suggest 
that nonparanoid schizophrenics evi- 
dence more than minimal scanning 


? Alterations in the degree and direction of 
Scanning responsiveness have also been 
achieved among normal subjects by differ- 
entially reinforcing perceived movement in 
different regions of an autokinetic field (Far- 
Tow & Santos, 1962). 
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only when such behavior is followed 
by self-satisfaction and a reduction of 
anxiety. They will evidence minimal 
scanning, resulting in constriction of 
the field, when attending to 
the field is followed by punishment and 
an increase in anxiety. The generaliz- 
ing of this aspect of the attention re- 
sponse, i. e., the formation of a minimal 
or excessive scanning response dispo- 
sition, is one way of conceptualizing 
the approach-avoidance behaviors of all 
organisms (Mowrer, 1960). It is also 
in line with Solley’s and Murphy's 
(1960) and Guthrie’s (1959) conten- 
tions that the attention response, like 
any other response, may be conditioned. 
One last point needs to be made 
here. It is extremely important to dis- 
tinguish scanning behaviors which are 
shaped in and are specific to a given 
experimental situation from character- 
ological scanning dispositions. It is 
not difficult to alter the attention be- 
haviors of schizophrenics who are 
either maximum or minimum scanners 
by controlling specific reinforcement 
contingencies in the experimental situ- 
ation (Silverman, 1963a). However, 
the more enduring aspects of a scan- 
ning response disposition are appar- 
ently based upon a long and stable 
history of reinforcement contingencies 
and are highly resistant to change. 
Size Estimation. Filmed eye-move- 
ment recordings of scanning behaviors 
have indicated that individual differ- 
ences in scanning behavior can be re- 
liably inferred from a particular form 
of size estimation procedure (Gardner 
& Long, 1962b). This size estimation 
procedure was employed in a recent 
study of schizophrenic patients in 
which marked scanning differences 
were hypothesized between paranoid 
and nonparanoid groups (Silverman, 
1964a). In accord with the Gardner 
et al. (1959) formulation, the pre- 
sumed minimal scanning nonparanoids 
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were expected to evidence large over- 
estimations of the standard stimuli, 
whereas extensive scanning paranoids 
were expected to underestimate stim- 
uli, The patients, whose diagnoses 
were unknown by the experimenter, 
were instructed to adjust a circular 
patch of light until it looked exactly 
like the size of a hand-held disk. Three 
different sized disks were presented 
four times each, in random order, 
for a total series of 12 judgments. 
Analyses of the mean size estimation 
differences between diagnostic groups 
yielded significant p values ranging 
from .0025 for the three separate stim- 
uli to .0006 for the total series of 
judgments. These marked differences 
were consistent with Piaget’s centra- 
tion hypothesis and the Gardner et al. 
scanning hypothesis. 

The application of this type of an 
analysis to some size estimation studies 
of poor-premorbid and good-premor- 
bid schizophrenics indicates significant 
scanning differences between the two 
groups. Further, these studies high- 
light the effect of anxiety level, physio- 
logical arousal, and degree of chro- 
nicity. (It will be recalled that the 
poor-premorbid classification describes 
individuals who show gradual or in- 
sidious prodromal signs of schizophre- 
nia beginning relatively early in life 
as contrasted with the developmentally 
more advanced good-premorbid schizo- 
phrenics with a good prognosis who 
show a relative absence of early pro- 
dromal signs and a rapid, stormy 
onset). 

Harris (1957) studied the size esti- 
mation performances of poor-premor- 
bid and good-premorbid schizophre- 
nics, (divided on the basis of the 
Phillips Prognostic Scale) and a non- 
psychiatric group. The subjects were 
asked to judge from memory the sizes 
of previously presented pictures which 
had been selected for their presumed 
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cue relevance for the patients. These 
pictures included mother-son scenes of 
overprotection, rejection (ignoring), 
domination, and feeding (acceptance), 
together with a more neutral tree-bush 
scene and a geometric figure (square). 
Significant differences were found be- 
tween the two patient groups which 
were not at all apparent when the total 
schizophrenic group was compared 
with the nonpsychiatric group. The 
poor-premorbid group tended to judge 
the stimuli to be larger than they actu- 
ally were (minimal scanning) ; the 
good-premorbid group tended to under- 
estimate them (extensive scanning) ; 
and the nonschizophrenic group 
showed minimal errors in size esti- 
mation (“balanced” scanning). Fur- 
thermore, the overestimation or under- 
estimation tendencies were also appar- 
ent when the frame containing the 
neutral picture was judged (Fig. I). 

In a subsequent study by Zahn 
(1959), positive and negative affective 
value were first conditioned to neutral 
pictures during a discrimination task 
not involving differential size judg- 
ments. Subjects’ responses to one 0 
the pictures were rewarded by the com- 
ment “RIGHT” and their responses to 
the other picture were punished by the 
comment “wrone.” A pilot study, in 
which subsequent judgments of the 
sizes of the stimuli were made from 
memory, was consistent with that of 
Harris. In addition to a general over 
estimation-minimal scanning tendency 
on the part of the poor-premorbid 
group, Zahn found that the poor-pre. 
morbid group overestimated the size ° 
the picture associated with punishment 
relative to their size estimations of the 
rewarded picture. | Good-premor?! 
subjects showed tendencies in the op- 
posite direction, The interaction be 
tween premorbid adjustment and rede 
tiveness to type of reinforcement was 
statistically significant. 
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Fic. 1. A comparison of mean size estimates of good and poor i 
schizophrenic and normal groups for all scenes. (From Harris, 1957). 


Both of these size estimation studies 
indicate the presence of a correlation 
between extreme scanning response 
dispositions and the good premorbid- 
poor premorbid or process-reactive 
classification. These experiments also 
support the contention that premorbid 
history factors are related to genetic 
levels of perceptual development 
(Becker, 1956; Fine & Zimet, 1959). 
Thus, the developmentally earlier ap- 
pearing minimal scanning response 
(Piaget, 1950) was characteristic of 
the poor-premorbid group, especially 
in their estimations of the pictures asso- 
ciated with punishment. The good- 
premorbid schizophrenics tended to 
react to these same pictures by exten- 
Sively scanning them and thus reducing 
their apparent magnitude. Note here 


that the scanning-response construct 
also provides a heretofore elusive ex- 
planation for the translation of attitudes 
toward pleasantness and unpleasantness 
into changes in estimated size. For 
example, individuals who ordinarily 
evidence minimal scanning, react to 
unpleasant stimuli by further reducing 
their scanning responsiveness and thus 
enhancing the perceived magnitude 
of stimuli, Underestimating-extensive 
scanners appear to scan to an even 
greater degree when confronted with 
unpleasant stimuli. 

That the relationship between pre- 
morbid adjustment and size estimation 
performance is far more complex than 
was first thought is illustrated in a 
later study by Zahn. In contrast to 
the results of the earlier studies, the 
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good premorbid group of the later 
study showed the overestimation trend. 
That is, compared to their size estima- 
tions of the rewarded stimulus and 
the other stimuli, they significantly 
overestimated the size of the punished 
stimulus and of a picture connoting 
domination (mother scolding son). 
On the other hand, the poor-premorbid 
group showed a generalized under- 
estimation tendency on all stimuli. 
The nonschizophrenics significantly 
overestimated the size of the punished 
picture relative to the reward picture 
on the early experimental trials, but 
not over the total number of trials. 
Further the good-premorbid group was 
the only group which judged the stim- 
uli differently. This almost complete 
reversal in the error patterns for the 
schizophrenic groups, as compared to 
the results of earlier size estimation 
studies, was discussed by Zahn in terms 
of two important considerations : 


1. Differences in anxiety between 
the schizophrenic groups may have af- 
fected size judgment performances. 
What qualitatively had appeared as 
higher anxiety in the good-premorbid 
group could have somehow operated to 
produce overestimation of the picture 
associated with failure (punishment). 

2. Significant differences in the 
length of hospitalization existed be- 
tween subjects who took part in Zahn’s 
latter study and subjects who partici- 
pated in Harris’ study and Zahn’s pilot 
study. Whereas mean lengths of time 
since first psychiatric hospitalization 
for the “goods” and “poors” in Zahn’s 
latter study were 6.00 years and 8.09 
years, respectively, in both of the 
earlier studies these means were 3 
years and less. 


A careful search of the literature 
indicates that, indeed, both level of 
anxiety or emotionality and degree of 
chronicity are inextricably interwoven 
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with the process-reactive hypothesis 
and that they are in fact crucial to an 
explanation of the above results. 8 

Using the same size estimation pro- 
cedure with college students as Was 
used with schizophrenics, Zahn (1960) 
found that subjects with high Taylor 
Manifest Anxiety (HA) scores showed 
significant relative overestimations of 
the punished picture, whereas low- 
anxious subjects did not. 


It is interesting to note, however in view of 
the similarities of method in the two experi- 
ments, that . . patients show a much 
greater difference between the success 
failure pictures than do the present 
normals. .... This greater difference for 
the patients would seem to indicate a greater 
degree of anxiety or perceptual lability or 
both [pp. 460-461]. 


It was also noted that “psychological 
anxiety” items of the Taylor scale were 
significantly related to the reward- 
punishment difference in size estima- 
tion but “somatic anxiety” items were 
not. 

Ilanit (1961), using the Elgin Prog- 
nostic Scale to divide patients into 
process and reactive groups, took spe- 
cial care to restrict his population to 
acute schizophrenics. Patients selected 
had no history of previous psychiatric 
hospitalizations and had been in 
hospital 18 months or less. Two meas- 
ures of anxiety, the Taylor Manifest 
Anxiety (MA) scale and the Welsh 
Anxiety Index, were administered 
all subjects along with various other 
psychological tests. Whereas no d 
ferences were found between the pry 
ess and reactive groups on the a 
scale, the Welsh Anxiety Index re- 
vealed significant differences in 
direction opposite from that W 
would ordinarily be expected and fr 
that predicted by Ilanit. The eaf 
process group had significantly high 
anxiety scores than did the early reac” 
tive group. The failure of the Je 
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scale to differentiate between groups 
may very well prove related to a dis- 
tinction between “psychological anx- 
iety” and “somatic anxiety.” This 
conjecture is presented in view of the 
significant findings on the Welsh anx- 
iety scale especially since the items 
on the Welsh scale refer primarily to 
personal feelings which an individual 
has about himself, i.e., “psychological 
anxiety.” 

Zuckerman and Grosz (1959) stud- 
ied autonomic nervous system reactiv- 
ity in process and reactive schizophre- 
nics by means of the mecholyl test, a 
cholinergic drug that normally pro- 
duces marked changes in blood pres- 
sure. The subjects were patients at 
a treatment center for acute psychiatric 
disorders. Of the 27 schizophrenics 
tested, only 4 of them were “post- 
treatment” cases. Thus the patient 
population was primarily an early 
schizophrenic group. The subjects, 
divided into process and reactive 
groups on the basis of the Kantor, 
Wallner, and Winder (1953) criteria, 
were administered mecholyl on four 
separate occasions since previous re- 
search (Grosz & Miller, 1958) had 
indicated considerable day to day vari- 
ability in response to this drug. The 
early process group showed a signifi- 
cantly greater drop in systolic blood 
Pressure following injection of mecho- 
lyl as compared with the early reactive 
group. 

This finding is in direct contrast to 
reports by King (1958), Funkenstein, 
Greenblatt, and Solomon (1950), 
Meadow and Funkenstein (1952) and 
others. In these latter studies all 
schizophrenic subjects had been 
hospitalized on at least one occasion. 
Furthermore, the results of similar in- 
vestigations with long-term chronic 
schizophrenics (e.g., Meadow, Green- 
blatt, Funkenstein, & Solomon, 1953; 
Bockoven, Greenblatt, & Solomon, 
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1953) would suggest that most of the 
patients in these “contradictory” studies 
had long amounts of total hospitaliza- 
tion. Data on number of previous 
hospitalizations, time elapsed since first 
hospitalization, and total length of time 
spent in mental hospitals were not re- 
ported in any of the “contradictory” 
research reports. In any case the re- 
sults of mecholyl test studies of chronic 
schizophrenics indicate that strong 
drops in blood pressure are not asso- 
ciated with the process category as 
Zuckerman and Grosz found in their 
study of acute patients, Rather among 
chronics, reactive schizophrenics with 
initial good prognoses show the 
stronger drops in blood pressure. 
Process chronic schizophrenics evi- 
dence minimal drops in blood pressure 
following mecholyl infusion. Such re- 
sponsiveness appears to occur after 
they have been schizophrenic for some 
time. Supporting this contention is a 
study by Weckowicz (1958) of the 
relationship between response to 
mecholyl and perceptual constancy 
among twenty long-term chronic 
schizophrenics (8 years mean hospitali- 
zation). Subjects tending to show 
normal overconstancy had a blood pres- 
sure response pattern most similar to 
one associated with poor prognosis 
(process schizophrenia) and least simi- 
lar to one associated with good prog- 
nosis (reactive schizophrenia). That 
is, blood pressure changes for the over- 
constant perceivers corresponded to the 
Group I-poor prognosis pattern found 
by Funkenstein and his colleagues, i. e., 
an initial drop followed by a larger rise 
in pressure. Underconstant subjects 
evidenced lesser or no subsequent rises 
in blood pressure. Such data are con- 


In addition, Weckowicz found that 
weaker drops in blood pressure tended to be 
associated with overconstancy. That this 
correlation did not reach statistical signifi- 
cance was probably due to a more constricted 


TABLE 1 d 


INFERRED RELATIONSHIPS BETWEEN PERCEPTUAL, EMOTIONAL, AND AUTONOMIC 
VARIABLES IN PROCESS AND REACTIVE SCHIZOPHRENICS 


Mean length of all hospitalizations | 
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Group 0-3 years 
Process Size overestimation-minimal scanning 
Higher anxiety 
Stronger drops in systolic blood pres- 
sure following mecholyl 
Reactive Size underestimation- extensive scan- 
ning 
Lower anxiety 


Weaker drops in systolic blood pres- 


sure following mecholyl 


sistent with Zahn’s findings with 
chronic poor-premorbid and good-pre- 
morbid patients wherein the poor-pre- 
morbids and not the good-premorbids 
evidenced perceptual performances most 
similar to those of nonschizophrenics. 
Taken together these data indicate 
that much of the apparent contradiction 
in the literature may be due to the 
confounding effects of varying chronic- 
ity within groups of patients who 
otherwise are selected in a reasonably 
careful manner. On the basis of 
studies which have controlled for de- 
gree of chronicity, either inadvertantly 
or otherwise, it is suggested that early 
process schizophrenics manifest emo- 
tional, autonomic, and attention re- 


range of values which are associated with 
drops in blood pressure as compared to rises. 
Thus, Funkenstein et al. (1950), reported 
that, for schizophrenics, blood pressure re- 
sponse patterns following mecholyl tend to 
cluster to a greater extent in two of seven 
groups, both of which are characterized by 
minimal drops in pressure. In Weckowicz’s 
sample of long-term chronic patients, unse- 
lected as to process or reactive type, it is 
quite probable that subjects clustered pri- 
marily into the two minimal drop-in-pressure 
groups. 


6 years and over (approximate 


Overconstancy and size underestin 
tion-extensive scanning 
Lower anxiety (inference) 


Weaker drops in systolic blood pre 
sure following mecholyl 


Underconstancy and size overes tim 
tion-minimal scanning 1 
Higher anxiety (inference) 
Stronger drops in systolic blood pri es 
sure following mecholyl * 


sponse characteristics almost con 
pletely opposite to those of 

process patients. “Opposite cha 
also have been found between 
and long-term chronic-reactive sehiz 
phrenics (Table 1). $ 


Field-Articulation Principle 


The previous section of this p 
reviewed and discussed studies on fi 
breadth of perceptual scanning 
relatively free conditions. The P 
ent section will be concerned with 
selective aspects of attention, especi 
those response mechanisms which ¢ 
termine what elements in a stimul 
field exert dominant influence 
behavior. Studies by Piaget (19 
and his colleagues, Witkin and 
colleagues (Witkin, Dyk, Fatet 
Goodenough, & Karp, 1962; W. 
Lewis, Hertzman, Machover, Me 
& Wapner, 1954), and Wapner @ 
Werner (1957) have demonstrated 
perceptual differentiation of com 
stimulus fields evolves in the co 
development. The experiences of í 
body, self, and the external en 
ment which early in life are essen 
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global, vague, and diffuse, become pro- 
gressively more articulated. However, 
even among “normal” adults wide vari- 
ations have been found in the degree 
of clarity with which the self and ex- 
ternal objects are segregated from the 
total perceptual field. These variations 
have been observed in laboratory pro- 
cedures requiring: (a) the perceptual 
separation of objects from their sur- 
rounds such as in the rod-and-frame 
test, (b) the differentiation of field 
cues from sensory cues such as in the 
tilting-room-tilting-chair task, (c) the 
perceptual restructuring of a stimulus 
field such as in the embedded figures 
task, and (d) the perceptual inhibition 
of distracting, intrusive cues such as 
in the color-word test or on the Muller- 
Lyer illusion. 

The concept of field articulation was 
elaborated by Gardner et al. (Gardner, 
1961; Gardner, Holzman, Klein, Lin- 
ton, & Spence, 1959; Gardner, Jackson, 
& Messick, 1960) to explain individual 
differences on tasks of “perceived in- 
congruity” in terms of differences in 
the selectiveness of attention. It was 
Pointed out that on all of the above 
mentioned procedures, effective per- 
formance was dependent upon directing 
attention to one of two sets of cues that 
induced opposing response tendencies. 
The field-articulation principle thus en- 
compassed Witkin’s concept of field 
dependence-independence (Witkin et 
al., 1954), Klein’s constricted-flexible 
control principle (Klein, 1954) and 
aspects of Piaget’s “strategies” of at- 
tention formulation (Piaget, 1950). 
Factor analyses of subjects’ perform- 
ance scores indicated significant inter- 
correlations among the tasks which 
defined these various concepts. Con- 
stant error performance patterns were 
Consistent and reliable. 

Ina recent study by Weiss and Sher- 
man (1961), the Stroop color-word 
test scores of acute and chronic para- 
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noid and nonparanoid schizophrenics 
were correlated with their scores on 
the Taylor MA scale. The color-word 
test requires that subjects not attend to 
the printed names of colors but that 
they restrict their attention to rapidly 
naming the colors in which the words 
are written. The initial hypothesis 
tested was that high anxiety during a 
task which evokes competing response 
tendencies would result in greater re- 
sponse interference and thus poorer 
performance. Low anxiety scores on 
the MA scale were expected to be as- 
sociated with more efficient perform- 


(median total hospitalization, 8 years) 
but not for acute schizophrenics (med- 
ian total hospitalization, 9 months). 
Further analysis indicated that the 
more anxious subjects in the acute 
paranoid schizophrenic category ac- 
tually tended toward decreased re- 
sponse interference on the color-word 
test. The difference between the MA 
scale-Stroop correlations for acute 
paranoids and chronic paranoids was 
highly significant. No differences were 
found between the acute and chronic 
patients without paranoid features. 
These results suggest that the para- 
noid’s readiness to deal analytically 
with a complex stimulus field may 
have a characterological aspect which 
is readily observable in the acute stages 
of the disorder. Furthermore, this 
disposition is most apparent under con- 
ditions of moderate or high anxiety. 
The finding that chronic paranoids did 
not employ this type of responsiveness 
when extraordinarily anxious could in- 
dicate that the field-articulation re- 
sponse disposition is significantly af- 
fected by chronicity per se. Additional 
evidence in support of this latter hy- 
pothesis will be presented at a later 


point. 
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Witkin et al. (1954) reported data 
on the tendency of hospitalized mental 
patients to appear at the extremes of 
the distribution, when compared to 
nonpsychiatric patients, in their per- 
formances on the rod-and-frame test 
and the tilting-room-tilting-chair test. 
The psychiatric patients showed a strik- 
ing concentration of cases in either the 
extreme field-independent or the ex- 
treme field-dependent groups. Only a 
small proportion of hospital cases were 
found in the intermediate performance 
group. This was in contrast to the 
nonpsychiatric subjects who tended to 
fall mainly into the intermediate per- 
formance category and rarely in either 
of the extreme categories. Out of 12 
males diagnosed as paranoid schizo- 
phrenic, 7 were found to be markedly 
field-independent (high field-articulat- 
ors), 2 evidenced intermediate category 
performances, and 3 were markedly 
field-dependent (low field-articulators). 
No data on male nonparanoid schizo- 
phrenics were reported. The fact that 
female schizophrenics, regardless of 
diagnostic subtype, tended to be mark- 
edly field-dependent was consistent 
with the finding that women, as a 
group, were considerably more field- 
dependent than men. 

Taylor (1956) studied the embedded 
figures test (EFT) performances of 
schizophrenics who were rated as being 
primarily delusional or primarily hal- 
lucinatory. In this task, the subject is 
asked to find a particular simple figure 
within a larger complex figure. The 
simple figure is hidden by being part of 
the pattern of the more complex fig- 
ure (e.g., the outline of the simple 
figure may partly define the boundaries 
of several more prominent patterns in 
the complex figure). In the Witkin 
studies, subjects who performed in a 
field-dependent manner on the rod-and- 
frame and tilting-room-tilting- chair 
tasks took a significantly longer time 
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than field-independent subjects to i 
late the embedded figures from the 
surrounds. In Taylor’s experime 
delusional and hallucinatory subject 
were selected on the basis of ratings 0 
a series of items derived from thi 
Lorr scale for rating psychiatric pa 
tients. Schizophrenics who exhibit 
both delusional and hallucinatory 
symptoms were excluded from th 
study. Results were consistent wi 
the hypothesis that delusional subject 
are more field-independent than hi 
lucinatory subjects as measured b 
their EFT performances. 

Witkin et al. (1962), recently 
viewed a number of studies of field: 
dependence-field- independence among 
various clinical groups. On the basis 
of the evidence considered, the au 
concluded that, 
when they become psychologically ill, persoms 
with contrasting modes of field approach a 


apt to develop different kinds of pathology 
[p. 209]. 


Some theoretical speculations were aisa 
offered about the etiology of the field- 
independent approach of patients in 
whom paranoid (delusional) reaction 
form a predominant symptom. In ac 
cord with the formulations of Freud 
(1958), Cameron (1947), and others, 
Witkin et al. (1962) suggested thal 
the paranoid structures his environ 
ment by projecting his own system 0° 
ideas upon it. He does so in a hi 
selective fashion, and is espec 
prone to implicate particular peoP™ 
and specific situations. 


Such selectivity requires that experience, ol 
the world be articulated. In this conne 


differentiation [p. 209]. 


From this it would follow that thi 
more perceptually and conceptua 
ticulated the manner in which the 
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noid experiences his world, the more 
systematized will be his delusional 
complex of ideas. 

Another implication to be derived 
from the field-articulation construct is 
suggested by some perceptual studies 
of children (Piaget, 1950; Wilkin et 
al., 1962). These studies indicate that 
the disposition to either a highly dif- 
ferentiated or poorly differentiated 
field-articulation response is present 
rather early in the course of develop- 
ment. In fact, studies by Escalona 
and Heider (1959) and others would 
suggest that these dispositions are pres- 
ent in the earliest days of life. It is 
conceivable then that such dispositions 
may be significant determinants of the 
type of schizophrenia that an individ- 
ual develops. One would therefore 
expect differences in the pre- illness 
adjustments of schizophrenics to be 
correlated with degree of field articula- 
tion. Thus, Bryant (1961) divided 
chronic schizophrenics into poor-pre- 
morbid (process) and good-premorbid 
(reactive) groups and studied their 
performances on the embedded figures 
test and on the rod-and-frame test. 
Subjects for each group were selected 
from those with extreme scores on the 
Phillips Premorbid Adjustment Scale. 
It was hypothesized that the reactive 
group would perform on these tests in 
an analytic, field-independent manner 
and that the process group would evi- 
dence field-dependent performance. 
Significant differences in group per- 
formances were found in the predicted 
directions. Mean deviations of the 
reactive group on the rod-and-frame 
test approximated the mean obtained 
by a college group in the Witkin et al. 
(1954) study. On the other hand, the 
mean deviation of the process group 
approximated the mean deviation which 
Witkin et al. found for 8-year-old 
children. 

These findings become even more 
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meaningful when considered in relation 
to personality factors which Witkin et 
al. found to be associated with field- 
dependent and field-independent per- 
formances. Thus field-dependent indi- 
viduals typically evidenced such 
attributes as: an inability to function 
independently of environmental sup- 
ports, an absence of initiating activity, 
low self-esteem, difficulty in controlling 
anxiety, and a relatively primitive, un- 
differentiated body image. The de- 
scription of the process (field-depend- 
ent) patients in these terms is 
consistent with the data on their in- 
adequate social-sexual adjustments 
(Garmezy & Rodnick, 1959). By the 
same token, the personality correlates 
of field independence are appropriate 
descriptions of the reactive patients 
who evidence more adequate social- 
sexual premorbid adjustments. Field- 
independent perceivers tend to be char- 


tively higher self-esteem, more effective 
techniques for controlling anxiety, and 
a more differentiated body image. 
Note here that field dependence-field 
independence and adequacy of premor- 
bid adjustment are not related to 

or absence of schizophrenia. 
Rather these data indicate that schizo- 
phrenia assumes different forms among 
people with relatively analytic, field- 
independent, and relatively diffuse, 
field-dependent orientations (Witkin et 
al., 1962). 

The differences in perceptual orien- 
tations of process and reactive schizo- 
phrenics also are consistent with the 
conceptualization that premorbid his- 
tory factors are related to genetic levels 
of perceptual development. In accord 
with the developmental levels hypothe- 
ses of Werner (1948) and Piaget 
(1954), the extreme perceptual orien- 
tations of process and reactive schizo- 
phrenics are interpreted as representing 
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lower and higher levels, respectively, 
of sensory, perceptual, and conceptual 
differentiation (Bryant, 1961). 


DEFENSIVE Aspects OF SCANNING 
AND FIELD ARTICULATION 


Bleuler in his writings on the prob- 
lem of attention in schizophrenia made 
the following comments: 


If “enslavement by optical impressions” is 
a pathological overemphasis of the sense 
impressions, its opposite is that under other 
circumstances the patient ignores the outside 
world completely. 

Between these two extremes, all transi- 
tions are possible. . . . At the one extreme 
patients are easily distractible, seem to lack 
directives of their own and depend entirely 
on outer impressions; this is the case when 
the symptom of naming prevails; at the other 
extreme they are undistractible and even the 
strongest stimuli will fail to influence their 
train of ideas or arouse their attention. 

In their daily life the majority of chronic 
cases show little that is remarkable in this 
respect: they are at their work and attend 
to the new event insofar as it interests them. 
They can be interrupted in their talk. In 
more severe cases, however, the distractibil- 
ity is diminished. . . . 

Lack of distractibility may appear to 
spring from the patients’ lack of interest: as 
they care about nothing, nothing can influ- 
ence their behavior. But it can be shown 
that the same patients understand very well 
what goes on about them, even when they 
pay no active attention. In certain acute 
conditions distractibility is greatly reduced, 
often even to the vanishing point. [Rapa- 
port, 1951, p. 640]. 


Attention-response dispositions 
emerge in the course of physical and 
psychological development. Like phys- 
iological and psychological needs, they 
are directive; they exert a selective in- 
fluence on the cognitive field (Klein, 
1954). Unlike needs, however, they 
are not necessarily linked to a class of 
satisfaction-giving objects. Rather 
they are activated by and operate upon 
perceptual and conceptual incongruities 
(e.g., comprehending an ambiguous or 
complex stimulus field). Although 
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conceptualized as learned behaviors, 
they are not simply “attentional habits” 
which are correlated with certain quali- 
ties of stimuli. Rather they are con- 
ceived of as general response disposi- 
tions which may express themselves in 
broadly equivalent ranges of environ- 
mental situations. It should be noted 
that since the two attention control 
principles being discussed here may op- 
erate independently of one another, 
various combinations of the two con- 
trols are likely to occur in an individual 
(eg., high scanning responsiveness-low 
field articulation responsiveness, etc. 
The diagnostic and behavioral implica- 
tions of these various combinations of 
controls for the schizophrenias are now 
being investigated Silverman, 1963b). 

The concept of “defense” is ex- 
tremely useful when relating attention 
response patterns to modes of master- 
ing anxiety. Thus it is suggested that 
early in the development of a schizo- 
phrenic disorder, as isolations, repres- 
sions, and denials become less and less 
effective, the individual begins to rely 
on the most basic forms of adient- 
abient adjustive mechanisms. These 
mechanisms include formerly “conflict 
free” attention-response dispositions. 
In this respect, schizophrenics resemble 
very young children whose earliest de- 
fensive behaviors are ones which in- 
volve motivated attention. Solley and 
Murphy (1960) have suggested that 
among the earliest aspects of the en- 
vironment that are attended to are 
those complex stimulus sources neces- 
sary for biological survival. These 
stimulus sources include internally Pro. 
duced and environmentally produce’ 
anxiety cues which serve as discrimi- 
native stimuli for the activation of at- 
tentional responses. External psycho- 
logically noxious cues which are Con- 
sistently coincident with attentional re- 
sponses also come to elicit these 
response dispositions (excessive t 
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minimal) in their own right. Under 
conditions of severe psychological 
stress, the characteristic mode of at- 
tention responsiveness utilized by the 
preschizophrenic is exaggerated and 
takes on a singularly defensive func- 
tion. That is either via what Bleuler 
referred to as “enslavement by optical 
impressions” or by “completely ignor- 
ing the outer world,” the schizophrenic 
gains extraordinary control over his 
perceptions. (As William James 
noted long ago, the mere reception of 
a stimulus is insufficient for perception 
since an individual’s experiences de- 
pend upon what he agrees to attend 
to.) 

Consider, for example the schizo- 
phrenic who relies on excessive scan- 
ning and who is constantly preoccupied 
with this orienting behavior. The de- 
velopment of this type of disposition is 
referrable to a particular set of en- 
vironmental contingencies in which ex- 
cessive scanning has consistently led to 
an attenuation in the intensity of 
anxiety. The excessive scanner has 
learned that his most effective means of 
escaping or avoiding anxiety is to be 
hyperalert to the presence of cues 
which often precede or co-occur with 
Noxious events (Berlyne, 1960). How- 
ever, while such behavior serves to 
Maximize the receipt of information 
about impending “threatening” events, 
it also has the paradoxical effect of in- 
creasing the probability that sources of 
aversive stimulation will be encoun- 
tered. Under these conditions an ex- 
cessive scanning disposition usually be- 
Comes coordinated with a propensity 
for constant reinterpretations of rela- 
tions between events and the motives of 
Others in a desperate attempt at pro- 
tecting one’s self-esteem. 

Research by Margaret Singer and 
Lyman Wynne (1963; Wynne & 
Singer, 1963a, 1963b) and clinical ob- 
servations by Arieti (1955) suggest 
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that an excessive scanning response 
may be fostered rather carly in life by 
an unusual form of communication be- 
tween parents and their preschizo- 
phrenic children. Singer and Wynne 
have observed that schizophrenics with 
“fragmented” thought disorders,“ show- 
ing the “cognitive alertness” of high 
scanners, have parents with particular 
kinds of communication problems, On 
their Rorschachs and TATs these par- 
ents evidence tangential ideas, distract- 
ing comments, and rather extreme non 
sequiturs. Their TAT stories are re- 
plete with fragments of alternative 
meanings and alternative feelings, 
usually none of which is selected out 
and completed. Furthermore, they 
tend to build an attitudinal context into 
their communications, in which the 
sources of ideas appear to originate out- 
side themselves. In turn, their chil- 
dren respond in cryptic, paranoid-like 
ways, e.g., they unearth hidden mean- 
ings in events and objects, perceive in 
terms of odd vantage points, treat new 
stimuli as familiar, and often experi- 
ence their thoughts as not their own. 
In contrast to patients whose early his- 
tories indicate a poor-premorbid social 
adjustment and a different family com- 
munication pattern, schizophrenics evi- 
dencing a “fragmented” type of 
thought disorder usually have had dis- 
tinctly better premorbid social adjust- 


ments. 
Arieti (1955) comments on the 
early social history of the excessive- 


tending and perceiving are apparent but 
wherein there are serious failures in the 
capacity to integrate ideation with impulse or 
affect and events with their contexts. This 
type of thinking disorder is found quite often 
among paranoid schizophrenics. Schizo- 
phrenics with “amorphous” forms of thought 
disorder evidence predominately undifferen- 
tiated attentional and perceptual responsive- 
ness (i.e, minimal field articulation). 
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scanning paranoid schizophrenic as 
follows: 


The parents of the paranoid type do not 
criticize the patients for their actions; they 
generally accuse the patients for their inten- 
tions or for lying. The child learns to 
defend himself by anticipating these accusa- 
tions, and therefore becoming anxious and 
suspicious, or by developing a faculty for 
rationalizations. He has to find almost a 
legal or technical way to protect himself 
from insinuations and accusations [p. 139]. 
. Other cases instead of focusing their at- 
tention on their pseudorational defenses, seem 
to sense or magnify any kind of hostility in 
the environment [p. 140]. 


The defensive maneuvers of the pre- 
paranoid thus dispose him to constantly 
scan environmental input for possible 
threats to self-esteem and to deal with 
such threats by selectively examining 
and translating their meanings. 

Additional observations by Singer 
and Wynne (1963) and by Arieti 

(1955) indicate that early social inter- 
action factors may foster the develop- 
ment of minimal scanning and/or 
minimal field-articulation dispositions. 
The parents of schizophrenics with 
“amorphous” thought disorders com- 
municate with their offspring in an ex- 
ceptionally vague or pseudospecific 
manner, There is no effort at clarify- 
ing obscure perceptual-associative re- 
sponses and the child’s emotionally dis- 
turbing experiences are dealt with in 
vague, global ways or not responded to 
at all. Thus environmental avoidance, 
blurred, and uncertain perception, and 
poorly organized ideas are all subtly re- 
inforced, whereas responses geared 
toward perceptual and conceptual dif- 
ferentiation most often result in in- 
creases in anxiety. 

In a discussion of the social history 
peculiarities of nonparanoid schizo- 
phrenics, Arieti (1955) suggests that 
more so than in other types of schizo- 
phrenia, catatonic schizophrenics, as 
children 
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are unable to accept their environme 
conditions, and at the same time they 
unable to fight them [p. 125]. 


Their break with reality occurs through 
“withdrawal from action.” Such 
havior is apparently based upon 
low subjective probability estimate that 
consensually valid overt actions reduce 
or terminate anxiety. 


The hebephrenic type occurs mostly under 
two sets of circumstances: either when 
arrest at a paranoid level does not 
anxiety or when the environmental factor 
not predispose to the development of p: 
defenses [pp. 155-156]. 

Frequently in the development of 
simple type of schizophrenia, in addition 
poor child parent relationships 
patient has had to compete with a youn 
sibling. . . . The patient abandons 
fight . . . and refuses to grow psychos 
logically [p. 156]. Rather than change real- 
ity, the simple schizophrenic limits reality 
He narrows his horizons to a large ext 
so that he will be able to make some œ 
promise with what is left of reality [Ariet 
1955, pp. 187-188]. 


Minimal-scanning and minimal field- 
articulating schizophrenics appear to 
avoid anxiety by directing their atten- 
tion away from the environment e 
onto internal processes such as haltu- 
cinations. Yet if this were completely 
the case, such individuals should ex: 
perience even greater anxiety than that 
which originally motivated their with. 
drawal. That is, the chaotic nature oF 
a prolonged, self-induced sensory dep- 
rivation state would operate to pa 
vent such schizophrenics from com 
pletely removing themselves from thet 
environments. They must register 
stimulation in order to live. Rapaport 
(1960) cites evidence for the position 
that the simplest everyday behaviors OF 
waking life are developed and ma 
tained only by “reaching out for s 
lation.“ He supports his position 
ing results of the isolation experiments 
by Bexton, Heron, and Scott (19540 
and others; the hypnosis studies 
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Brenman and Gill (1947); and the 
sensory disarrangement experiments by 
Werner and Wapner (1955) and 
others. Of particular interest are a 
series of experiments by Held (e.g. 
Held & Hein, 1958) which indicate that 
our usual spatial orientations, however 
stable they may seem to us, are in the 
normal course of life brought about 
and maintained principally by “reaffer- 
ent stimulation.” It seems clear then, 
that what may appear in schizophrenia 
as the directing of attention away from 
the environment cannot be considered 
as a total shift of attention onto internal 
processes. Rather what appears to be 
involved are: (a) an anchoring of at- 
tention on dominant objects in the 
stimulus field (minimal scanning) and/ 
or (b) global and unarticulated atten- 
tiveness to sensory inputs. Such 
fixedly motivated attending operates 
selectively to minimize or “gate out” 
much of the perceptual and conceptual 
input from other sources. The neuro- 
physiological basis for such a gating 
mechanism is well documented in the 
research literature for both animals and 
humans (Berlyne, 1960, Ch. 3). 

Clinical observations by Arieti 
(1961, 1962), Rapaport (1951), and 
Bleuler (1951) support such a concep- 
tion of environmental withdrawal in 
schizophrenia. Arieti (1961) has fre- 
quently found that very regressed or 
very acutely ill patients perceive inte- 
grated stimulus patterns, not as unities, 
but rather in an extraordinarily unin- 
tegrated manner. 


For instance, a patient looking at the nurse 
could not see or focus on her as a person but 
perceived only her left or right eye or her 
hand or her nose, etc. Another patient who, 
while she was in a state of dangerous excite- 
ment was put in a seclusion cell, remembered 
that she could not look at the whole door of 
the cell. She could see only the knob or 
155 1 or some corner of the door 
P. 8]. 
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Arieti also observed that with increases 
in excitement, these perceptual sub- 
units divide into even smaller frag- 
ments. The disintegration of wholes 
phenomenon, termed “Awholism,” ap- 
parently is experienced by the schizo- 
phrenic as happening to him and he 
may even attempt to reconstruct his 
fragmented wholes into unities, When 
this occurs the reconstructed wholes 
turn out to be unlike the original ones 
and a different type of perceptual dis- 
tortion results. 

Rapaport’s (1951) explanation of 
hypermnesia also lends itself to the 
position presented here. (Hypermnesia 
refers to the capacity of the schizo- 
phrenic to recall experiences as well or 
better than normals while making no 
differentiations between the salient and 
. iences). 

port states that alt attention 
Rey be temporarily withdrawn from 
reality, the schizophrenic invariably 
makes a new attempt at contact with 
his environment. Such new contact is 


Similar behaviors have been recorded 
by Kraepelin (1919). 

The above clinical observations, as 
well as the researches presented earlier, 
indicate that the notion that attention is 
directed away from the environment in 
schizophrenia is in need of major re- 
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formulation. The attention-response 
formulation presented in this paper is 
suggested as a general interpretative 
statement of the clinical and experi- 
mental literature on disturbances of at- 
tention inschizophrenia. Thus Arieti’s 
observation which he termed Awholism 
(the inability to perceive wholes) may 
be interpreted as being due to extreme 
minimal scanning or fixation of mo- 
mentarily dominant objects in the 
visual field—analagous to pathological 
staring behavior observed in people 
who are in semistuporous conditions. 
Rapaport’s and Bleuler’s observations 
on adult schizophrenics lend themselves 
quite well to an explanation in terms of 
pathologically low field articulation 
such as was found among young chil- 
dren by Witkin and which usually be- 
comes less and less pronounced with 
age. Furthermore, it just does not 
make sense to say without specific 
qualification that hypermnesic people 
pay little attention to the world around 
them. If in fact one active perceptual 
event inhibits all other perceptual acts 
depending upon their respective degree 
of incompatibility (e.g., direction of 
the perceptual act) how can someone 
both attend to the outside world “hy- 
permnesically” and still be said to live 
in an autistic encapsulated world. 
On the other hand, an explanation 
phrased in terms of exaggerated at- 
tention-response mechanisms has con- 
siderable explanatory power. These 
mechanisms are conceptualized in terms 
of an interaction of essentially two in- 
dependent processes, scanning and field 
articulation, which in particular (and 
as yet unknown) combinations predis- 
pose schizophrenics to structure their 
environments in different ways. 


A Schizophrenic Defense Mechanism 


At this point there still remains the 
task of tying together the attention-re- 
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sponse conceptualization with two im- 
portant developmental aspects of 
schizophrenia (a) the historical vari- 
ables which have contributed to the 
“choice” of the schizophrenic resolution 
and (b) the effects of long-term im- 
mersion in the schizophrenic psycho- 
logical environment. In regard to the 
first of these issues, there is impressive 
evidence which suggests the importance 
of early experiences on the develop- 
ment of a particular kind of schizo- 
phrenia and of a particular attention 
response pattern. The observations of 
Singer and Wynne (1963) and Arieti 
(1955) concerning interpersonal rela- 
tionships and communication between 
parents and their schizophrenic chil- 
dren are clearly consistent with the for- 
mulation presented here. While fur- 
ther systematic research is needed, such 
observations do appear especially note- 
worthy. Additional indications of the 
importance of early social environment 
on attention responsiveness are found 
in the process-reactive literature which 
is saturated with developmental in- 
ferences and analyses (e.g, Becker. 
1956, 1959; Fine & Zimet, 1959; Rod- 
nick & Garmezy, 1957). The field- 
differentiation hypotheses of Werner 
(1948) and Witkin et al. (1954, 1962) 
are most significant in this area (€-8» 
Bryant, 1961). Lastly, Piaget’s (1950) 
conception of the developmental as- 
pects of centration (scanning) has been 
found to be remarkably consistent with 
the process-reactive concept, at least in 
the early stages of schizophrenia (Har: 
ris, 1957; Zahn, 1959). However, the 
apparent inapplicability of the centra- 
tion hypothesis to the attentional re- 
sponses of long-term chronic process 
and reactive patients is, at first glance, 
disconcerting, to say the least. Clearly, 
the generality of the attention. respond 
formulation is limited unless an © 
planation can be found for those studies 
in which chronic reactive patients er 
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dence minimal scanning responsiveness 
as compared with chronic process pa- 
tients. Even more intriguing is the 
question of why the scanning behaviors 
of chronic process patients are ap- 
parently no different from those of non- 
schizophrenic individuals (Zahn, 1959). 
These findings necessitate a considera- 
tion of the second of the two develop- 
mental issues, namely what might be 
the differential effects of prolonged hos- 
pitalization on patients having different 
premorbid social histories and perhaps 
different symptom patterns. 

In order to deal with this latter 
problem, it will be necessary to postu- 
late an attention mechanism which will 
be termed “sensory input processing- 
ideational gating.” The data in sup- 
port of this formulation come mainly 
from some neurophysiological findings 
by DeVault (1955) and from a study 
of reaction to perceptual ambiguity by 
Draguns (1963). The clinical obser- 
vations of Bleuler on excessive regis- 
tration of sensory impressions, of 
Arieti on awholism, and of Rapaport 
on hypermnesia are also especially well 
suited to interpretation in terms of this 
mechanism. The neurophysiological 
data indicate an autonomic activation 
pattern among process-chronic schizo- 
phrenics in which response to sensory 
stimuli is present but in which response 
to ideational stimuli and to noxious 
physical stimuli is strongly inhibited 
(negative change in heart rate, de- 
creased GSR). Reactive-chronic 
schizophrenics show no autonomic in- 
hibition to nonnoxious ideational stim- 
uli. The perceptual ambiguity study 
indicates a preference among the more 
chronic schizophrenics to respond to 
the diffuse, undifferentiated aspects of 
stimuli rather than to their connotative 
attributes. On the basis of these find- 
ings, to be described in more detail be- 
low, the following formulation is pre- 
sented : 
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1. Process-chronic schizophrenics 
characteristically minimize or “gate 
out” disturbing ideational inputs, both 
perceptual and conceptual, while at 
the same time responding to sensory 
inputs. 

2. Reactive-chronic schizophrenics 
remain responsive to various ideational 
aspects of their environments as well as 
to sensory stimulation but they regress 
to a genetically earlier form of scan- 
ning responsiveness.’ 

3. Under extraordinary stress con- 
ditions all schizophrenics evidence in- 
dications of ideational gating. 


DeVault (1955) studied the physio- 
logical response patterns (GSR, heart 
rate) of groups of chronic-process and 
chronic-reactive patients and of a non- 
schizophrenic group. Mean lengths of 
hospitalization for the two schizophre- 
nic groups were 8.8 years and 87 
years, respectively. Response patterns 
to a verbal warning followed (at least 
30 seconds later) by a loud bell and to 
scenes of hostility, sex, dependency, and 
a “neutral” scene, were found to be 
significantly different for the three 
groups. The reactive-chronic group 
evidenced a significantly higher physio- 
logical arousal pattern than the proc- 
ess-chronic group. In contrast to the 
response pattern of the process group 
to the pictorial stimuli, the pattern of 
the reactive and nonschizophrenie 


6 Pearl (1962) has suggested that process 
schizophrenics, who he equates with chronic 
schizophrenics, are in a state of reduced 
stimulus input, whereas reactive schizo- 
phrenics, equated with acute schizophrenics, 
tend to be characterized by a state of stimu- 
lus overload. The general tenor of Pearl's 
hypothesis parallels the one presented here. 
However, note that the studies reviewed in 
this paper have indicated that the differentia- 
tion between process and reactive schizo- 
phrenia on the one hand and degree of 
chronicity on the other hand, which Pearl 
does not consider, is an extremely important 
distinction, 
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groups corresponded to one normally 
observed when subjects are presented 
with ideational stimuli or noxious 
physical stimuli (Darrow, 1929; Jost, 
Ruilmann, Hill, & Gulo, 1952). The 
response pattern of the process group, 
however, resembled one not normally 
associated with reactions to ideational 
stimuli but rather one which is asso- 
ciated with responses to sensory stimuli 
(light, sound, etc.). This effect was 
noted in addition to the relative hypo- 
reactivity of the process patients as 
compared to reactives and nonschizo- 
phrenics. 

The hyporeactivity of the process 
group was not complete, however. The 
process schizophrenics did respond, but 
in a way different from that of the 
normals or reactives, with a decelera- 
tion (negative change) in heart rate 
after all the stimuli. This response, 
which according to Darrow is usually 
found after sensory stimuli, was often 
as large or larger than the positive 
changes in heart rate given by the nor- 
mals and reactives. DeVault (1955) 
points out that during the presentation 
of warning and bell combination, 


where a negative change might have been 
expected and was found to occur in the 
normal group, (but not in the reactive 
group) the negative change of the process 
group was significantly larger than that of 
the normals, giving an indication of a 
stronger than normal response on this oc- 
casion [pp. 50-51]. 


Autonomic inhibition was also ob- 
served on the GSR polygraph record- 
ings immediately after presentation of 
the verbal warning stimulus. Both 
schizophrenic groups evidenced a dim- 
inished GSR to the verbal warning 
when compared to the nonschizophre- 
nic group. Since diminished GSR 
usually occurs in response to strong 
sensory stimulation rather than to 
strong ideational stimulation, this latter 
finding may indicate that all schizo- 
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phrenics respond with “ideational gat- 
ing” when they encounter stimuli with 
high “arousal power.” 

Draguns (1963) investigated the 
perceptual reactions of chronic and 
acute schizophrenics to ambiguous and 
sparse stimulus inputs. It was hy- 
pothesized that chronic patients as 
compared to acute schizophrenics and 
nonschizophrenics would demonstrate 
a strong disposition toward relatively 
premature and erroneous interpreta- 
tions of perceptual data. That is, the 
chronic group was expected to respond 
to the diffuse, physiognomic qualities 
of the stimuli. Note that this predic- 
tion implied a receptive attitude to the 
sensory aspects of the data being pre- 
sented by the experimenter. 

One of the techniques employed to 
test this hypothesis utilized realistic 
drawings of objects or scenes which 
were photographically blurred to vary- 
ing degrees. Nine series consisting 
of 12 pictures each were presented 
(beginning with the least clear presen- 
tation) to 34 chronic patients, 25 
acutes in their first month of their first 
hospitalization, and 36 hospital attend- 
ants. (Subjects were equated for age, 
sex, amount of schooling, and verbal 
intelligence.) Each individual was in- 
structed to respond at any point in a 
series, provided he “had an idea” of 
what the picture represented. Draguns 
(1963) found that the chronic group 
made their judgments at a significantly 
earlier point in time than did the non- 
schizophrenic group and that they 
made significantly more perceptual rec- 
ognition errors as well. The perform- 
ance of the acute group fell consistently 
between those of the chronic and non- 
schizophrenic groups. 

These results are interpreted here 
as indicating that when arousal is m- 
creased experimentally due to the high 
uncertainty value of the stimulus data 
being presented (Berlyne, 1960), the 
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schizophrenic will respond to the non- 
connotative aspects of the inputs. Fur- 
thermore, this set will be taken despite 
having the knowledge, that adopting a 
more deliberate perceptual response 
orientation could, by itself, result in 
gradual uncertainty reduction. Such 
a “stylistic responsiveness to informa- 
tion input [Dragun’s own term, p. 
29]” is adopted in order (a) quickly 
to reduce uncertainty and (6) to ren- 
der innocuous the potentially noxious 
qualitative aspects of inputs from the 
environment. To deal with experi- 
ences in this manner enables the schizo- 
phrenic to filter threatening connota- 
tions, i.e., to respond to reality on a 
piecemeal basis “so as to guarantee 
that no relationships will be met which 
will arouse impulses and anxieties” 
(Rapaport, 1951).7 Sensory input 
processing-ideational gating is sug- 
gested as a mechanism through which 
such cognitive reorganizations take 
place in schizophrenia.“ 


The concept of the “filtering” of sensory 
data has been discussed in detail by Broad- 
bent (1958). It has also been applied to the 
re-evaluation of several studies of the effects 
of certain drugs on the narrowing 
broadening of attention (Callaway & Band, 
1958; Callaway & Dembo, 1958; Callaway & 
Stone, 1960). In fact, the modifiability of 
the scanning response via drugs suggests the 
Possibility of an experimental chemotherapy 
Program which could have important impli- 
cations for the treatment of schizophrenia. 

8 The developmental and motivational bases 
for this mechanism are conceived to be: (a) 
initially, an acute psychological state of self- 
initiated sensory deprivation which McRey- 
nolds (1960) has postulated in his discussion 
of schizophrenic withdrawal, (b) a conse- 
quent reduction of sufficient sensory inputs in 
an organism who requires such inputs in 
order to exist, and (c) the gradual modifica- 
tion of a self-initiated sensory-deprivation 

strategy” in terms of the processing of 

Sensory input but the simultaneous non- 
Processing of ideational input. The extent to 
which this mechanism is utilized is apparently 
dependent on premorbid history and clinical 
Syndrome factors, 
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Summary 


The data and theories reviewed indi- 
cate the following important relation- 
ships between the schizophrenic proc- 
ess and attention-response dispositions : 


1. Extreme scanning and/or field- 
articulation perceptual dispositions 
characterize the attention response 
styles of most schizophrenics, Assess- 
ment of these response dispositions can 
be made by determining the constant 
error patterns of an individual's per- 
formances on a variety of psychophysi- 
cal laboratory tasks. 

2. Within the Kraepelinian categori- 
zation system, extensive scanning and 
field-articulation responsiveness are as- 
sociated with the paranoid subtype, 
whereas minimal scanning and undif- 
ferentiated field-articulation respon- 
siveness are associated with the cata- 
tonic, simple, and hebephrenic subtypes. 

3. A moderate or nonextreme atten- 
tion deployment response pattern in 
schizophrenia is associated with a good 
prognosis. 

4. Within the process-reactive clas- 
sification, undifferentiated field-articu- 
lation responsiveness is characteristic 
of process schizophrenics with poor 
premorbid developmental histories. 
Reactive schizophrenics evidence a 
more highly differentiated field-articu- 
lation responsiveness. These distinc- 
tions are consistent with the “genetic 
levels of cognitive development” hy- 
pothesis set forth by Werner, Piaget, 
and others. 

5. The significance of the scanning 
response disposition in either of its 
extreme forms in schizophrenia derives 
on the one hand from its relationship 
to personality structure and on the 
other hand through its use in the analy- 
sis of basic perceptual orientations to 
the environment. 

6. Long-term involvement in the 
schizophrenic resolution appears to be 
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associated with change in attention re- 
sponse style. The kind of change is 
dependent upon the particular “style” 
which was operative during the pre- 
morbid and early phases of the schizo- 
phrenia. 

7. The extensive utilization of an 
“ideational gating” mechanism which 
filters disturbing connotative environ- 
mental inputs is postulated as the ulti- 
mate schizophrenic defense mechanism. 

8. The interactions between such 
important variables as the process- 
reactive distinction, physiological reac- 
tivity, degree of chronicity, anxiety 
levels, and Kraepelinian subtype are 
complex, to say the least. However, 
these interactions can be systematically 
explored within the attention response 
style model. 
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A POSSIBLE OPTICAL BASIS FOR MONOCULAR 


SLANT PERCEPTION’? 
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Optic variables that correspond to various properties of distal substances 
and define the axial coordinates and the slant of a surface are isolated in 
the light to the eye. The textural elements composing a surface are 
assumed to be randomly irregular in shape, size, and separation; and to 
have certain light-reflecting properties; but no prior assumptions are 
made about the position and illumination of the surface. The accurate 
specification of distal surface slant from optic variables is demonstrated 
both when the head of the viewer and the surface are motionless and 
when the surface is moving. Testable hypotheses that are derived from 
the theoretical model are mentioned at various places. Presumptive 
abilities necessary to the registration of the optic variables are also 


described. 


Theoretically, accurate monocular 
slant perceptions are possible even 
though a motionless viewer has no 
previous experience with a particular 
substance, even though the textural 
elements of a motionless surface are 
irregular in size, shape, and separa- 
tion, and even though parts of the 
surface are illuminated differently. 
This paper attempts to demonstrate 
the truth of this proposition, basing 
the demonstration on two hypothetical 
optic variables—the optical invari- 
ance of congruent luminous patterns, 
and optical n. The hypothetical 
basis for these two variables is shown 
to depend on two postulates about 
nature. To preclude covert intro- 
duction of unwanted assumptions, 
the abilities presumably needed to 
register the optic variables are out- 
lined at each stage of the paper. 


OPTICAL TEXTURE 


Every point (station point) in and 
above terrestrial space is partly sur- 
rounded by surfaces. The light 


1 This paper was supported in part by Pub- 
ic Health Service Research Grant, MH 
06942-01, from the National Institute of 
Mental Health. 


reflected from these surfaces travels 
in straight pathways and converges 
on individual station points as a solid 
spherical bundle of light rays. Gibson 
(1961) calls this bundle of light rays 
the optic array. Because of different 
incident lights falling on surfaces and 
because of the differential reflectances 
of substances, the light transmitted 
along different pathways in an optic 
array has different intensities an 
spectral values. The optic array 
therefore possesses a texture. This 
optical texture (see also Gibson, 1950a, 
1950b, 1961) may be considered as à 
luminous pattern of relative intensi- 
ties. It may be considered also as à 
single pattern or as subdivided into 
many individual patterns. 


OPTICAL INVARIANCE OF CONGRUENT 
Luminous PATTERNS 


Consider the possibility (Postulate 
I) that substances (groups of sub- 
stances, substantive structures, ©" 
substructures) of a particular class, 
possessing tangible regularities (e.. 


congruencies and similarities) in lay- 


out and structure, have unique light 
reflecting properties and reflect @ 
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unique pattern of relative light inten- 
sities invariant to that class. As a 
corollary, substances of a different 
class reflect a different but equally 
unique pattern of relative light 
intensities. 

Postulate J implies that the optical 
texture possesses spatially extended 
congruent luminous patterns that 
uniquely refer to substances of the 
same class. If it can be shown that 
the congruent luminous patterns tend 
to remain unchanged despite changes 
in the size, shape, position, and illumi- 
nation of the distal substances, then 
Postulate I implies a concept of 
optical invariance. 

Consider a collection of like sub- 
stances—the grass blades in a lawn, 
for example. If two grass blades are 
identical in size and shape, are 
illuminated in exactly the same way, 
and are at the same distance from and 
slant to a station point, then both 
(according to Postulate I) should 
reflect congruent patterns of relative 
luminous intensities to the station 
point. Scanning one of these lumi- 
nous patterns in steps of 30 seconds 
of arc, one might find the sequential 
spatial order of absolute luminous 
intensities to have the following 
(fictional) values: 


60 15 20 35 60 
15 20 38 60% [1] 


The pattern of relative changes in 
the light can then be described as: 


41.8 ZS 
1 1.3 23 [2] 


_ This luminous pattern of relative 
intensities refers to some part of the 
total luminous pattern reflected by a 
substance like a grass blade. If this 
luminous pattern of relative intensi- 
ties is optically invariant, it must 
remain unchanged despite changes in 
size, shape, position, and illumination 


ron MoxocuLar PERCEPTION 381 
of the substance containing its distal 
referents. 

Changing Incident Light. II the 

illumination on one blade is increased 
uniformly until the reflected absolute 
intensities at the station point are 
doubled (i.e., 120, 30, 40, 70, 120 . . ). 
the pattern of relative intensities 
remains invariant as in Sequence 2 
above. 
Changing Distance. When two grass 
blades are presented as before, but 
with one at a greater distance from 
the station point, the luminous pat- 
terns have the same relative intensi- 
ties. The absolute intensities differ, 
of course, as a function of the inverse 
square law. 

Magnification. Matter supposedly 
has almost infinitely fine structure. 
As one of the grass blades is brought 
closer to a station point, the angular 
sizes of its fine structure increase. In 
scanning part of a grass blade from a 
station point in steps of 30 seconds of 
arc, the absolute reflected intensities 
of light that are discriminable might 
be as in Sequence 1 above, and the 
relative intensities as in Sequence 2 
above. At one half the distance, the 
absolute intensities for the same rays 
are: 240, 60, 80, 140, 240 Be- 
cause the angular separations of these 
light rays are approximately 60 sec- 
onds of arc, the new readings in steps 
of 30 seconds of arc might be: 240, 
160, 60, 160, 80, 160, 140, 160, 240... 
Correspondingly, at one fourth the 
distance the readings might be: 960, 
360, 640, 840, 240, 360, 640, 840, 320, 
360, 640, 840, 560, 360, 640, 840, 
960... The relative intensities are 
then: 4, 1.5, 2.7, 3.5, 11.5, 27, 3.5, 
K 3.5, 2.3, 1.5, 2.7, 3.5, 


At one fourth the distance a repeti- 
tive subcycle, 1.5, 2.7, 3.5, emerges in 
the optical texture. This subcycle 
could not be observed at the most 
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extreme distance where the cycle 4, 
1, 1.3, 2.3 alone was registered. At 
one fourth the distance, three classes 
of congruent luminous patterns can 
be identified, viz., 4, 1, 1.3, 2.3; 1.5, 
2.7, 3.5; and the combination of these. 
Presumably, each of these classes of 
congruent luminous patterns refers to 
a unique level of substance. (It will 
be shown later that any one of these 
classes of luminous patterns can be 
used to specify the slant of the sur- 
faces in which their distal referents 
are found.) 

Changing Size and Shape. Chang- 
ing the size or shape of a grass blade 
might affect some aspects of a lumi- 
nous pattern, but the pattern of the 
subcycles should remain the same. 

Gradations of Illumination. If il- 
lumination varies over each grass 
blade, luminous patterns for grass 
blades are not congruent. Neverthe- 
less, small regions of a grass blade may 
remain under approximately constant 
illumination, although the level of 
illumination differs from region to 
region. In this event, congruent 
luminous patterns again can be ob- 
served in the optical texture: the 
congruent luminous patterns might 
correspond to the ribs of a grass blade 
(or of a group of grass blades), for 
example, instead of to a more molar 
level of structure. 

_ Slanting a Surface. Two bits of 
identical matter may be illuminated 
in the same way and have the same 
light-reflecting properties, but because 
they lie in the slanted surface of some 
substance, they may be at different 
distances from a station point. In 
that event, the intensities of their 
reflected lights at the station point 
differ as a function of the inverse 
square law. These differences in ab- 
solute intensities are relatively trivial, 
however, for identical bits that lie 
within small regions on the surface, 
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Registration of the Optical Invariant. 
This section has set forth a concept 
of optical invariance for substances of 
the same class. When Postulate | 
holds true, the luminous patterns 
reflected by like substances tend to be 
congruent and unchanged despite 
changes in the position, size, shape, 
and illumination of the distal sub- 
stances. 

If this optical invariant is to be 
perceptually useful, visual systems 
must have the abilities necessary to 
register it. These include the ability 
to: (a) register relative changes and 
patterns of relative changes in lumi- 
nous intensity ; and (b) determine that 
one pattern of relative changes is the 
same as, or different from, a second 
pattern. 


LINEAR REGULARITIES IN PHYSICAL 
SUBSTANCE 


Congruent luminous patterns iden- 
tify a distal collection of what are 
called hereafter, like elements. Like 
elements possess the same substantive 
structure but are not necessarily the 
same size and shape. In addition, 
adjacent like elements are not neces 
sarily separated by the same dis- 
tances. Although like elements may 
be irregular in size, shape, and separa- 
tion, adjacent collections of like ele- 
ments and their interspaces may be 
relatively uniform in size and shape. 
The truth of this proposition depends 
on a second assumption about nature 
(Postulate II), Although like ele- 
ments (grass blades in a lawn, Or the 
ribs of all grass blades in a lawn, etc.) 
vary in size, shape, and separation, 
such variations over a limited tef- 
restrial region are normally distrib- 
uted,? are random, and have 4 


F ion of 
alt is not clear that the assumption i 
normality is necessary: rectilinearity © 
distribution may be sufficient. 


A Posse Orricat Basis 


finite variance. This section shows 
how adjacent collections of like ele- 
ments and their interspaces can be 
relatively uniform in size (see also 
Purdy, 1958). 

Consider a lawn where the soil has 
relatively equal nutrients. Individual 
grass blades? probably differ in width 
and are separted by different dis- 
tances. But, in accordance with 
Postulate II, these variations are 
random, are normally distributed, 
and ‘ave a finite variance. Measur- 
ing of the widths and separations, 
one might find the combined mean 
width and separation of all grass 
blades to be .15-inch, with a standard 
deviation of .022-inch. This popula- 
tion of widths and separations could 
be sampled by stretching a wire in a 
straight line across a part of the lawn 
until it crossed, for example, nine 
grass blades and their separations. 
This procedure could be repeated at 
100 other places on the lawn. Based 
on the population values, 95/100 of 
those linear extents would lie between 
approximately 1.22 and 1.48-inch, 
with a relative error of 9.6%. These 
limits are given by the following 
equation: 


Limits = n(Mpop + 1.96 ¢/V¥n) [3] 


In this equation » is the numerosity 
of like elements in a collection; Mpop 
is the mean size and separation of the 
population of like elements; ¢ 15 the 
standard deviation of the population 
of sizes and separations; and 1.96 is a 
constant. 

The relative error is given by the 
term: (Lı — Li) / Ci + La), where 
Lı and La are the upper and lower 
limits, respectively. When the rela- 


Although I have based the analysis in this 
Paper on grass blades that form a flat surface, 
the analysis could have been based on any 
pie of distinguishable substantive struc- 
ures, 
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tive error is small, collections of n 
like elements are relatively regular in 
size. Relative error, therefore, is a 
measure of regularity and can be 
replaced by an equivalent expression 
that specifies the degree of regularity 
of a collection of like elements: 


Degree of regularity 
= 1.0—1.965/MVn (4) 


In Equation 4 is the estimated 
standard deviation of the population, 
based on the sample variance; Af is 
the mean of the sample; and u is the 
number of elements in the sample. 


sampling procedure described above is 
repeated for 36 instead of 9 widths 
and separations, the degree of regu- 
larity (using Equation 4) becomes 
.952 instead of .904. Increasing the 


by making s smaller (i. e., by making 
the elements more uniform in size, 
shape, and separation). 


OPTICAL u 


In Equation 4 the degree of regu- 
larity depends on three variables. 
Two of those variables, M and 8, 
are indigenous to the particular 
matter, whereas the third variable, 
n, may be given any arbitrary value. 
What value n is given depends on the 
degree of regularity that is desired. 
The choice of a value for n is to be 
specified in accordance with optic 
criteria—the method of determina- 
tion is described in the next section. 
But first an optic variable is described 
that corresponds to collections of » 
like elements. p 

Consider again the lawn described 
above. Ata station point, the optical 
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texture is packed with congruent 
luminous patterns, each corresponding 
to a grass blade. Select any luminous 
point (small luminous spot) in the 
optical texture, calling it the optic 
fixation point. With a line called the 
optic line of regard, connect the optic 
fixation point to the center of the 
optic array. Construct a meridian 
called the optic meridian through the 
optic fixation point. Along the merid- 
ian, count off a number, n, of contig- 
uous congruent luminous patterns, 
calling this unit, optical n. Continue 
counting off such units until the merid- 
ian in the vicinity of the optic fixation 
point is subdivided into numerically 
equal units of n. These constructions 
are represented below in Fig. 1. 

The numerosity of in optical n 
corresponds exactly to the numerosity 
of n in the corresponding collection of 
n like elements. Whereas several 
collections of m like elements are 
relatively regular in size, however, 
the angular sizes of the optic units 
are almost never so. Only in those 
special instances when several col- 
lections of like elements are at the 
same distance from the station point 
are the optic units angularly uniform. 


VALUE OF OPTICAL n 


In order to specify accurately the 
slant of a distal surface from optic 
variables, optical n must be given a 
proper numerical value. That value is 
shown to depend on optic conditions 
which satisfy four criteria. 

The optic meridian in the vicinity 
of the optic fixation point has been 
subdivided into numerically equal 
units of n. The angular sizes of the 
optic units may tend then to increase 
gradually along a segment of the 
meridian, to reach a maximum at the 
optic fixation point, and to decrease 
symmetrically in angular size. If 

these bilateral angular symmetries 


do not occur along the meridian, the 
meridian is to be rotated on the optic 
fixation point until they do occur. 
Whenever like elements lie in a plane 
and conform to Postulates | and II. 
bilateral symmetries may be observed 
along just one optic meridian through 
each optic fixation point. 

For some numerical value of optical 
n, the series of bilateral angular 
changes along the meridian may tend 
to reach a maximum at the optic 
fixation point, but the series may fail 
to have perfect symmetries. There 
may be reversals in the series. There 
may be bilateral asymmetries in the 
series, although there are no reversals. 
Nevertheless, the proper value of 
optical is assumed to be that value 
of n that satisfies the following four 
criteria and correspondingly identi- 
fies relatively regular units of a like 
elements: 


I. The series of gradually increas- 
ing, bilateral angular sizes along the 
optic meridian tends to reach a 
maximum at the optic fixation point. 

II. The reversals and asymmetries 
in the series of angular sizes are 
distributed at random throughout the 
total series. 5 

III. The numerical size of n is 
inversely proportional to the observed 
number of reversals and asymmetries 
in the total series. 

IV. Other meridians parallel and 
close to the optic meridian yield the 
same value for n as does the optic 
meridian. 


In those instances where Postulates 
I and II fail to describe nature ac 
curately, a value of n may satisly 
Criteria I to IV but the corresponding 
units of n like elements may not be 


t There is an exception: when the optic line 
of regard, extended to intersect the surface, 
is normal to the surface, all optic meridian 
through that optic fixation point will yie 
these symmetries. 
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OPTIC ARRAY 


Fic. 1. Section of the optical texture showing optic units of n congruent 
luminous patterns and corresponding units of » like elements. 


probability of satisfying the criteria 
should yield an improper value for n 
and, as a consequence, perceptual 
errors. Those conditions are a basis, 


regular. In those instances illusory 
perceptions might occur (e.g., Gibson 

Flock, 1962). Any conditions that 
actually prevent or decrease the 
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therefore, for empirically testing this 
model and have corresponding im- 
portance. For example, satisfying 
Criteria I to IV depends on scanning a 
specific number, N, of collections of 
n like elements. Under conditions 
that satisfy the criteria, therefore, 
there are Nn like elements repre- 
sented in the optical texture. If 
everything else remains equal but s 
in Equation 4 is increased in magni- 
tude, Nn like elements would be 
numerically inadequate for yielding 
a proper value for optical n. 

Various viewing conditions are also 
theoretically crucial to optical u. 
First, when the viewing distance is 
too great, many like elements may not 
be liminal for the scanning instru- 
ment. As a result, the corresponding 
congruent luminous patterns that are 
liminal may be numerically inade- 
quate. Second, a population of like 
elements may be optically liminal, 
but the view of them may be angu- 
larly too restricted. If the field of 
view is too small the number of like 
elements represented in the optical 
texture may be too few. There is, 
therefore, an interdependence between 
variables like s and n, and viewing 
conditions. 

The problem of numerical adequacy 
is complicated further by an addi- 
tional limitation of scanning instru- 
ments like the human eye. An optical 
texture may be densely packed with 
liminal congruent luminous patterns. 
The corresponding like elements may 
even be regular in size, shape, and 
separation. Despite these seemingly 
optimal conditions for giving n a 
proper value, it may be impossible 
to do so. Over the optical texture or 
part of the optical texture the ratio 
of the largest to the smallest angular 
size of optical n may be greater than 
but close to 1.0. This can occur when 
an angular field of view is very small. 
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Under these conditions a scan 
instrument may register all angul 
sizes as the same and fail to atti 
Criteria I to IV. Increasing the fiel 
of view, therefore, increases the 2 
of the ratio and corresponding 
increases the probability of registerin 
changesin the angular size of optical 


Optic VARIABLES FOR THE 
CoorDINATE AXES OF 
A DISTAL SURFACE 


Giving a proper value to optical 
does not assume that a collection 
like elements physically forms 
plane. It assumes only that som 
plane (hereafter called a surface) cal 
be,passed through the like element 
The coordinate axes of that surfac 
are defined by three optic variable 
the normal meridian, the great-cirele 
arc, and the optic line of regard. 

Normal Meridian. Giving a valu 
optical n may require repeated rote 
tions of the optic meridian at W 
optic fixation point. The optic meré 
ian along which the angular sizes 0 
optical n are bilaterally symmetrical 
is called the normal meridian. 

The extended optic line of regat 
called the line of regard, intersects th 
surface at a point called the 
point. In this context the center 
the optic array is called the s/a 
point. A plane (the plane of reg 
passed through the optic line of reg 
and the normal meridian, when 
tended, intersects the surface o 
line called the normal axis © 
surface. The normal axis is 
pendicular to the line of regar 
projects the normal meridian. y 

Great-Circle Arc. Optical n rea! 
an angular maximum on merle 
parallel and close to the no 
meridian (see Criterion IV above: 
great-circle arc traced through 
optic maxima is normal to the pat 
meridians and passes through © 
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optic fixation point. A plane passed 
through the optic line of regard and 
the great-circle arc, called the plane 
of the slant, when extended, intersects 
the surface on a line called the line of 
rotation. This plane forms a plane 
dihedral angle of 90° with the surface. 
Within this plane a line through the 
fixation point and perpendicular to 
the line of regard is the vertical axis 
of the surface. The vertical axis 
projects the great-circle arc. 

When extended, the optic line of 
regard corresponds to the depth axis 
of the surface. These constructions 
are shown in Fig. 2. 


OPTICAL theta 


Each optic term in the preceding 
sections describes some characteristic 


STATION 
POINT 


Fic. 2, Constructions defining 
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of a distal collection of elements. 
First, the optical invariant (congruent 
luminous patterns) distinguishes sub- 
stantive classes (like elements) in the 
material world. Second, optical # 
specifies regularities among randomly 
irregular like elements. Third, the 
normal meridian, great-circle arc, 
and optic line of regard define the 
coordinate axes of a surface passed 
through a collection of like elements. 

The slant angle, 0, of the surface 
the angle to which the surface has 
been rotated on its normal axis out 
of the frontal parallel plane—is meas- 
ured by the angle included between 
the line of rotation and the vertical 
axis of the surface.* The optic vari- 

+ An alternate way to specify the angle of 
slant of the surface is to take the angle be- 


SURFACE 


THE SURFACE 


coordinate axes and slant angle, 


6, of a surface. 
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Fic, 3. Optic angles, a, y, &, and equal angles, 8, for specifying optical theta. 


able that specifies the slant of the 
surface is optical theta. 

Derivation of Optical Theta. Select 
any two luminous points (spots) 
lying in the great-circle arc, making 
their angular midpoint the optic 
fixation point. This yields two equal 
angles, beta, centered on three optic 
points. From the three optic points, 
trace out along their respective optic 
meridians the angles, a, y, ¢, until 
each angle subtends one optic unit. 
(See Fig. 3). 


tween the line of regard and the line of the 
rotation. That angle is always equal to the 
angle of slant described above, plus 90°, I 
have employed the vertical axis of the surface 
instead of the line of regard as the initial side 
of the angle of slant in deference to psycho- 
logical usage, which seems to prefer the frontal 
position as 0° slant rather than taking the 
plane of the line of regard as 0° slant, 


The slant angle, 0, of the surface is 
given by an optic expression, opt 
theta, involving two ratios of the 
gles, a, y, G. This optic expression 
6 


is: 
4K? — (1 + KP 
= cos”! ae 
Hept = COS "N TK — K) 
[5] 
Kut cota 
coty 
where: 
5 cota 
hy cote 


When angles, a, y, S, are less than 
30°, optical theta is hardly changed by 
substituting the ratios y/a and % 
for K and K’, respectively. When the 


The derivation of the following expression 
is given in Flock (1962), Appendix A, 
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surface is flat, optical eta remains 
unchanged if the angles beta are taken 
increasingly larger and if angles a, y, 
o, subtend more than one optic unit. 
When the surface is curved, this 
process yields changing values for 
optical theta. Here, then, isacriterion 
for distinguishing flat from curved 
surfaces. 

Registration of Optical Theta. In 
addition to the abilities presumed 
necessary for registering congruent 
luminous patterns, the additional 
abilities for registering optical n and 
optical theta are: (a) to trace out 
parallel meridians in the optical tex- 
ture; (b) to count off equal numerical 
units of congruent luminous patterns 
along an optic meridian ; (c) to observe 
relative differences in angular sizes of 
optical n; (d) to register optic loci 
where angular sizes of optical n reach 
a maximum; (e) to bisect an optic 
angle; (f) to register ratios of angular 
sizes of optical u, e.g., the ratio be- 
tween two angular sizes is 3.5:1, or 
3.7:1; (g) to possess a discriminatory 
mechanism that can take ratios of 
angular sizes in the form specified by 
Equation 5. 


Discussion 


Optical theta describes the slant of a 
surface that is passed through a col- 
lection of like elements. The sepa- 
rate conditions for seeing that the 
elements “belong together” as a plane 
surface are not considered here. 

In this paper the slant of a surface 
has been defined with respect to the 
line of regard (a line fixed by the sta- 
tion point and the fixation point). 
When the line of regard changes, the 
axial coordinates and the slant of the 
surface are correspondingly different 
By introducing constants for the up- 
ward or downward sweep of the line 
of regard, interpolations presumably 
could be made from slant as defined 
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by the line of regard to slant as de- 
fined by gravity. 

Optical » on which optical theta 
depends has been defined as a unit 
composed of several congruent lumi- 
nous patterns and their luminous 
interspaces, Thus, a regularity among 
randomly irregular like elements was 
identified optically. The assumption 
that like elements are metrically 
irregular in size, shape, and separa- 
tion may be more general than is 
sometimes required. Like elements 
(or their separations) sometimes may 
be metrically congruent along one or 
more dimensions but not along others. 
A surface might possess elements, for 
example, that are approximately the 
same size in a left-right direction, are 
randomly irregular in size in an up- 
down direction, and are distributed 
randomly over the surface. As a 
result, individual congruent luminous 
patterns lying along left-right merid- 
ians might satisfy Criteria I to IV. 
In that event, optical » would have a 
numerical value of n = 1.0 and would 
not include the luminous interspace. 
Optical theta could then be calculated 
by comparing * angular sizes of 
three congruent luminous patterns 
separated by equal angles beta, and 
located at the great-circle arc. 

The light reflected from a surface 
includes different classes of congruent 
luminous patterns. Any one of these 
classes can specify the slant of the 
surface. The optical texture posses- 
ses, therefore, redundant informa- 
tion. Which class or classes of con- 
gruent luminous patterns is used by 
the perceptual process depends, pre- 
sumably, on a range of factors (e.g. 
Flock, 1963). All things being equal 
perhaps the class of congruent lumi- 
nous patterns which allows the smallest 
numerical value to optical u is the 
most salient for the perceptual proc- 
èss. At any rate, if perception in- 
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volves a confirming and disconfirm- 
ing scanning process, estimates of 
optical theta would be tested for dif- 
ferent values of u as well as for dif- 
ferent classes of congruent luminous 
patterns. Redundancy of informa- 
tion functions in this model, there- 
fore, as a basis for confirmation, dis- 
confirmation, and correction. Re- 
dundancy is not assumed to function 
in any additive or interactive way. 

The formulation for optical theta 
assumes that both the viewer's head 
and the distal elements are motion- 
less. The normal condition of organ- 
isms is to move; and the normal 
environment of an organism possesses 
movement. Nevertheless, a motion- 
less viewer in a motionless world has 
more than academic interest. Beyond 
a few hundred feet there is little 
effective binocular disparity, and the 
human eyes are universally focused 
and not converged. Unless the or- 
ganism is moving rapidly there is 
little effective motion parallax for 
distant objects. Except for rapidly 
moving “others,” therefore, the optic 
conditions for the viewer looking at 
distant objects are essentially those 
of a motionless viewer in a motionless 
world. The motionless situation per- 
tains, therefore, to distance percep- 
tion and has corresponding practical 
importance. 

If either the surface or the viewer is 
in motion the specification of optical 
theta is considerably simplified. It is 
to be noted that when a viewer moves 
with respect to some part of a surface, 
the optic conditions are exactly the 
same as when that part of the surface 
moves with respect to the viewer. 
The motion, whether it be of the 
surface or the viewer, therefore, can 
be referred to the surface. 


The simnlification of ontical theta 
when there is motion can be described 
briefly. Consider a rigid surface that 
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is in motion normal to the line of 
regard. Every point in the surface 
must be displaced through a linear 
distance, d, in time, t. If the surface 
is flat, the corresponding optic points 
have different angular velocities. All 
of the angular velocities in the optical 
texture travel along parallel optic 
meridians, reach a maximum at a 
locus of points called the great-circle 
are, and bilaterally are at a minimum 
where the optic meridians converge. 
Select some optic point on the great- 
circle arc as the optic fixation point. 
Optical theta can be calculated from 
Equation 5 by comparing the angular 
displacements, a, y, ¢, for time, f, of 
three luminous spots which are sepa- 
rated by equal angles beta and are 
initially located on parallel meridians 
at the locus of the great-circle arc. 

When the surface is in motion, 
Postulates I and II, congruent lumi- 
nous patterns, and optical 7 do not 
enter the calculation of optical theta. 
Under these conditions the accuracy 
of estimation of optical theta pre- 
sumably depends only on an optical 
texture in which the luminous changes 
are liminal, are sufficiently numerous, 
and are distributed over an extended 
optic area. 

The theoretical model represented 
by optical theta yields numerous 
hypotheses that can be tested em- 
pirically. Some of those hypotheses 
have been and are being tested. 
Gibson and Flock’s (1962) explana- 
tion of the mountain illusion, for 
example, is based on a systematic 
exception in nature to Postulate 
II. Flock and Moscatelli (1964) 
have found that increasing s in 
Equation 4 while holding constant 
the field of view and the numerosity 
of like elements yields less accurate 
slant judgments. They found also 


that once optical n has a minimal 
value, adding other regularities to à 
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surface texture hardly improves slant 
judgments. Redundancy in the form 
of several classes of like elements 
also has been shown not to im 

slant judgments (Flock, 1963). In 
an extended set of experiments with 
a surface in motion Flock (1964b) 
has shown that substituting good 
for bad textural shapes, changing 
velocity, prior exposure, concom- 
mitant perceptions of shape have 
little or no effect on the accuracy of 
slant judgments.” 

The occurrence of slant perceptions 
is assumed to depend on optic 
variables. (For a more extended dis- 
cussion of the assumptions underlying 
this point of view vis-à-vis, for ex- 
ample, Helmholtzian or Gestaltist 
orientations, see Flock, 1964a) One 
may know that a collection of distal 
elements does not conform to a 
slanted surface, and still one may see a 
slanted surface, as in the case of the 
cloud bank or of the virtual image 
on the cinema screen. The percep- 
tual result depends on the saliency 
of the relevant optic variables. A 
perceiver may, of course, disbelieve 
his perceptions or ignore optic vari- 
ables. Those are other problems and 
lead to other considerations. 

At several places in this paper 
abilities are listed which are pre- 
sumed to be necessary for the registra- 
tion of the various optic variables. 


Many more of the hypotheses mentioned 
throughout this paper have been tested by 
the writer and the data are being prepared for 
Publication. 
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The level of difficulty implied by 
those abilities is, for the most part, 
that of making comparisons. How 
and when those abilities develop 
and how much experience a viewer 
must have with the optic variables 
before the abilities emerge are pre- 
sumably empirical questions. 
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An O’s decision in a psychoacous 


tic detection experiment is governed 


by 2 broad classes of determinants: (a) external determinants, such 


as the likelihood of a particular 


wave form being a signal in noise 


and (b) internal determinants, such as the momentary state of the O's 
nervous system, his response biases, or biases as to certain sequences 
of responses. An analysis of these determinants and especially their 
relation to a trial-by-trial prediction of the S’s response (a molecular 


psychophysics) was undertaken. 
relative importance of the extern: 


The analysis suggests that the 
al and internal determinants can be 


assessed by measuring the consistency of Os’ responses to repeated 


presentations of the same auditory signals. 


An experimental tech- 


nique is introduced to measure such consistency and the results of 
some experiments are summarized. The results indicate that the 
amount of external variability is about the same as the internal 
variability. The implication of this result for any attempt to predict 
trial-by-trial behavior is discussed. 


The ultimate goal of psychoacoustics 
is to understand the relation between 
the acoustic stimulus and the observer’s 
reaction or response to this stimulus. 
Unfortunately, this relationship can be 
influenced by a variety of nonacoustic 
variables. Even in as simple a situa- 
tion as a detection experiment in which 
the subject is asked whether or not a 
certain signal is present, the observer’s 
response is determined by his criterion 
for the presence of the signal. This 
criterion, in turn, is determined by 
motivational and instructional variables 
that are, strictly speaking, nonstimulus 
variables. 
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Although detection theory has con- 
tributed a means of specifying the 
criterion, the increase in the amount 
of data needed to achieve this specifi- 
cation has almost precluded its adop- 
tion as a standard technique in sensory 
psychology. In an effort to bypass the 
criterion problem in its entirety, some 
experimenters advocate that a forced- 
choice method be used. In a two- 
alternative, forced-choice test, for 
example, the subject is presented with 
two temporal intervals and asked to 
state in which of the two intervals a 
signal was presented. Any asymmetry 
in the subject’s choice of the two inter- 
vals would then be explained, presum- 
ably on the basis of some asymmetry in 
the sensory processes or some response 
biases. The problem here is that, al- 
though response biases or response 
preferences have long been studied, 
there is still ambiguity about the mag- 
nitude of these effects in typical sensory 
experiments. The ambiguity exists 
largely because most of the evidence 
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for the operation of these response 
mechanisms occurs in experimental 
situations where relevant sensory in- 
formation is held at minimal levels. 

The immediate goal of this paper is 
to assess the relative contribution of 
these nonstimulus variables in a typical 
psychoacoustic detection task, one in 
which the auditory stimuli, at least on 
some of the trials, are exercising con- 
siderable influence on the observer's 
behavior. If the extent of such non- 
stimulus variables could be measured, 
then one might try to devise experi- 
mental techniques aimed at minimizing 
the influence of such factors in psycho- 
physical experiments. Alternatively, it 
may become apparent that response 
biases are an integral part of all psy- 
chophysical phenomena and that a 
recognition of these biases and their 
operation is essential for meaningful 
interpretations of psychophysical re- 
sults. In the latter case this technique 
provides a first step in the considera- 
tion of such factors. Finally, as a more 
remote goal, is the hope that the pres- 
ent investigation will provide some 
information on the feasibility of devel- 
oping a more molecular approach to 
psychophysical phenomena. This last 
aim will become apparent later in the 
paper, 

The paper begins with an analysis of 
the nonstimulus variables. That, in 
turn, leads to the introduction of 
a particular experimental technique, 
which, it is then argued, provides a 
realistic estimate of the influences of 
these nonstimulus variables in a psy- 
chophysical experiment. Preliminary 
results obtained with this technique 
establish an upper bound on the influ- 
ence of stimulus variables in a two- 
alternative, forced-choice detection task. 
Unfortunately, the exact character of 
the nonstimulus variables is still am- 
biguous. We cannot, for example, at 
Present distinguish between pure re- 
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sponse preferences and internal or 
biological noise. A further extension 
of the technique is proposed that should 
clarify this ambiguity. 

Before we consider the nature of the 
nonstimulus variables, let us review 
briefly the typical subject matter of 


psychophysical research. 
Morar Psycuorrysics 


Consider a typical psychophysical 
experiment and the procedures used 
in collecting and presenting data. One 
often obtains large numbers of re- 
sponses to various stimulus intensities 
until some critical value of stimulus in- 
tensity (for example, the threshold 
intensity) is determined. Once the 
estimate of this intensity is obtained, 
the experiment is continued by chang- 
ing some aspects of the situation, such 
as the frequency or duration of the 
signal, and repeating the basic pro- 
cedure to obtain another estimate of 
this critical intensity. Finally, a graph 
is plotted with this critical intensity of 
the signal as the dependent variable 
and the other physical parameter as the 
independent variable. Thus, for ex- 
ample, the results show how the in- 
tensity of the signal must be changed 
as a function of signal duration to pre- 
serve the just detectable character of 
the signal. Note that the relation is 
between two physical aspects of the 
stimulus; the parameter is the average 
of a set of responses. Clearly, the in- 
ferred results apply only to average 
responses and not to the responses of 
the individual subject on a trial-by-trial 
basis. Thus most of traditional psy- 
chophysics is what we shall call 
“molar” psychophysics, to emphasize 
the point that the basic subject matter 
is an average of many individual psy- 
chophysical judgments. 

It is often convenient to forget this 
fact and, indeed, many of the models in 
sensory psychology might be taken to 
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be models that would or could be used 
to predict individual responses. In 
fact, no such models have ever been 
used in this ways and almost all of 
the basic data of psychophysics pertain 
to averages or collections of responses 
rather than to individual responses. 
The necessity for this molar, rather 
than molecular, analysis is obviously 
dictated by our tacit understanding that 
the individual responses may at times 
be influenced by variables other than 
the physical characteristics of the stim- 
ulus. Two broad categories of these 
nonstimulus variables can be identified. 
One category is internal noise, the 
presence of random fluctuations within 
sensory systems that are claimed to 
influence the observer’s judgments. 
The other broad category of nonstimu- 
lus variables encompasses a variety of 
supposed mechanisms, such as pure 
response mechanisms, time-order ef- 
fects, stimulus expectancies, and the 
like. The communality of these diverse 
mechanisms is that all of their effects 
are derived from an analysis of the 
sequence of past responses. Thus they 
agree theoretically that an analysis of 
the past sequence of responses can be 
used, to predict the next response. 
Molar psychophysics, by averaging 
over many responses, is used to remove 
or suppress these nonstimulus influ- 
ences, and hence to reveal the “pure” 
stimulus determinants of the response. 
Frequently several subjects are em- 
ployed, and the results from the dif- 
ferent subjects averaged to suppress ef- 
fects particular to individual subjects; 


3 Sherwin, Kodman, Kovaly, Prothe, and 
Melrose (1956), is the singular exception 
to this generalization. Unfortunately, their 
attempt to predict trial-by-trial behavior was 
not very successful. The present paper was 
heavily influenced by their work and, in fact, 
might be regarded as an analysis of the 
reasons for their lack of success. For earlier 
work on the subject, see Aron and Irwin 
(1932), Preston (1936a), Preston (1936b), 
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then the analysis is not only molar but, 
strictly speaking, fails to apply to any 
individual subject. Rather, such re- 
sults apply to some collection of sub- 
jects or some average observer. 


MOLECULAR PSYCHOPHYSICS 


Given the preceding designation of 
molar psychophysics, the subject mat- 
ter of molecular psychophysics must be 
the individual response of the individ- 
ual subject. The virtues of such a level 
of analysis are many, but two deserve 
special comment. First, a trial-by-trial 
analysis of the observer’s response may 
provide a critical and efficient test ofa 
hypothesis that is impossible on the 
molar level—impossible either because 
there is no difference in prediction on 
that level or because the amount of 
data required to select one hypothesis 
over another would be excessive. One 
can think of examples in which the 
relative efficiency of the molecular level 
of analysis is greater by the number 
of responses used to determine the 
average response on the molar level. 
Second, and more pertinent to this 
paper, molecular analysis provides a 
rigorous and demanding test of ideas 
about the manner in which nonstimulus 
variables influence the judgment proc- 
ess. Understanding one of these vari- 
ables on the molecular level is equiva- 
lent to being able to predict its influence 
on individual responses. Understand- 
ing the operation of a nonstimulus 
variable on the molar level is equivalent 
to being able to reduce some asymmetry 
in the subject’s average judgments by 
elaborate counterbalancing procedures. 
If the initial asymmetry is not very 
great, the effectiveness of these elabo- 
rate experimental procedures is fre- 
quently difficult to determine. 

The ultimate success of molecular 
psychophysics depends, of course, on 
how large a portion of the individual 
response is determined by the stimulus, 
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or by easily monitored response proc- 
esses. If the individual responses are 
largely determined by internal noise, 
then a prediction of trial-by-trial re- 
sponses will never be more than mar- 
ginally successful. It is possible, for 
example, that Sherwin, Kodman, Ko- 
valy, Prothe, and Melrose (1956), who 
attempted to predict individual re- 
sponse, were largely unsuccessful be- 
cause there was too much internal 
noise. Alternately, their simulation 
of the sensory process might have been 
wrong, or perhaps their weakness was 
incorporating only stimulus variables 
in their prediction scheme. These are 
some of the alternatives that this paper 
tries to assess. Let us return to our 
two broad categories of nonstimulus 
variables and indicate the manner in 
which they relate to the prediction of 
individual responses. 


SEQUENCE OF PAsT RESPONSES 


There is little question that the sub- 
ject is influenced by the sequence of 
his past responses. What happened 
on the past response, or the conjunc- 
tion of what happened on the past 
response and whether or not that 
response was correct, certainly will 
have some effect upon the response 
given on the present trial. There has 
been ample experimental demonstra- 
tion of the statistical significance of 
such factors. The question at issue 
is the size of these effects; what is the 
relative importance of these variables 
in actual psychophysical tests? 

If response mechanisms could be un- 


Senders and Soward (1952) contains a 
good review of the earlier published works. 
McGill (1957) summarizes much of the 
most recently published data. Apparently 
the first person to directly investigate the 
independence of judgments within the method 
of constant stimuli was Fernberger. For 
earlier work on subject, see Aron and Irwin 
(1932), Preston (1936a), and Preston 
(19366) 
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derstood in detail, then the experi- 
menter, simply by tabulating the past 
responses, could work out the tendency 
for various responses and, by combin- 
ing this variable with the stimulus 
factors, could, in theory, predict the 
trial-by-trial behavior of the subject. 
Actually, of course, there are very few 
available models that even attempt to 
deal with both stimulus and response 
variables. Those theories that achieve 
the most definite expression of the 
response factors are usually vague in 
their characterization of the stimulus 
and vice versa.“ 

There is, in addition to this lack of 
theoretical structure, a rather difficult 
experimental problem that has never 
been fully faced. The problem is that 
practically all of the data demonstrating 
the importance of response factors has 
been collected in situations in which 
there is little or no sensory stimulus. 
The experimental situation is often 
equivalent to asking the subject to 
guess the location of the stimulus in a 
two-alternative, forced-choice task with 
the stimulus intensity set to zero. In 
such a situation, the observer has little 
chance to utilize any sensory informa- 
tion, since it is held to a minimal level 
throughout the experiment. Conse- 
quently, it is not too surprising that 
the subject’s responses are statistically 
dependent.“ Unless the subject can 
generate completely random sequences, 


s See Luce (1959). The stimulus is 
summarized by a multiplicative constant 
that, when greater than unity, increases one 
response tendency relative to another. 
Whether the constant refers to signal power, 
logarithm of signal power or some other 
transformation of the signal wave form is 
left as a problem to be determined em- 
pirically. This vagueness about the stimulus 
in response theories is counterbalanced by 
sensory or stimulus theories that, since there 
is no absolute need to represent response 
biases within the theory, usually ignore them. 

o Responses obtained in ESP experiments 
have often been analyzed to determine re- 
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no other outcome would be expected. 
Furthermore, setting the signal to zero 
may produce a bias in and of itself. 
If there is no signal, then there is no 
means of reinforcing the subject for 
making subtle stimulus distinctions. 
There is, in fact, no way of insuring 
that the responses obtained from the 
subject are even partially determined 
by the same class of variables that 
operate when the subject is making 
stimulus distinctions. The subject 
may, as the test continues, begin to 
generate intricate sequences of re- 
sponses simply to satisfy some whim 
or fancy of his own. No objective in- 
dication of this change in the observer’s 
behavior will occur. Indeed, from the 
experimenter’s point of view, the 
emergence of certain regularities in the 
experimental data may be very rein- 
forcing, and it does not seem wholly 
impossible that this pleasant state of 
affairs is communicated to the ob- 
server in subtle ways. Even dismissing 
this possibility, turning the stimulus 
off in a psychophysical experiment 
seems to satisfy all of the formal re- 
quirements for the generation of super- 
stitious behavior, as it has been studied 
in lower animals (Skinner, 1948). 
Superstitious behavior is, after all, 
nothing more than an elaborate stereo- 
typed repetition of certain sequences 
of responses. Thus in a very real 
sense the data obtained in psychophysi- 
cal experiments without a sometimes 
detectable signal may have very little 
predictive value in typical psycho- 
physical experiment.’ 


sponse sequence characteristics. See Good- 
fellow (1938), Skinner (1942), and Solomon 
(1949). The paper by Speeth and Mathews 
(1961) illustrates the use of a very low 
intensity signal in studying response factors, 

7The second conclusion of the work of 
Senders and Sowards (1952) is as fol- 
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On the other hand, if the subject is 
achieving nearly 100% correct re- 
sponses in a two-alternative, forced- 
choice task, then the stimulus, infla- 
ence is obviously so strong that the 
response factors cannot manifest them- 
selves in the data. Whether the mani- 
fest and latent levels go hand in hand 
is largely an unanswered question, 
One might maintain, for example, 
that as the stimulus factors become 
more influential, the response factors 
are driven to zero; or one can defend 
the position that as the s/nulus factors 
become more influential, the presence 
of response factors became harder to 
detect in the data, but that on the latent 
level they are as powerful as ever. 
Each position has some testable conse- Í 
quences, but little work has been 
carried out on this problem. f 

As previously observed, all of these 
determinants, simple response biases, — 
time-order effects, motivational changes 
brought about by making a correct r 
response on the previous trial, and so 
forth, while different on a theoretical 
level, are measured and inferred from i 
the history of previous responses. 
They are operationally similar and 
thus, for the purposes of this paper, we 
shall combine all of these effects under 
the broad category of response-sequence i 
effects.® i 


N 
lows: “no subjects should be required to 
make threshold or near threshold discrimina- 
tions for many times in succession. Under 
these conditions only extremely patient, well- 3 
trained subjects can continue to let their 
responses be determined by the stimulus 
alone; others will develop a pattern of 
responses and each response will tend to be f 
response- determined rather than stimulus- 
determined. If threshold measurements ate 
to be made, easier discriminations shoul 
interspersed Ip. 373].” 

s This designation of response sequence 
effects is very nearly what McGill (1957) 
called “guessing.” “ ‘Guessing’ then becomes 
a label for effects attributable to a broad 


1 


` 
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INTERNAL NOISE 


Finally, we come to a concept that 

has had a very long history in psycho- 
physics. Historically, internal noise, 
some random’ perturbation of the sen- 
sory processes, was the main impetus 
toward the development of psycho- 
physical methods. Internal noise is an 
inferred quantity, its presence deduced 
from its manifest effects. It cannot be 
observed directly even by physiologists, 
for until the sensory code is completely 
understood, one can hardly distinguish 
what part is noise and what part sig- 
nal. Thus, presently, on both physio- 
logical and psychological levels, noise 
is syhonymous with the presence of ef- 
fects that are not fully understood. 
_ On an operational level, internal 
noise is equivalent to the observation 
that the same physical stimulus may 
elicit different responses. In a sense, 
then, internal noise is the limiting fac- 
tor in a trial-by-trial prediction of the 
subject’s response. 

It should be noted that response fac- 
ors and internal noise are similar in 
that both prevent a complete prediction 
of the subject’s response from stimu- 
lus factors alone. They differ largely 
in that presumably, a more complete 
understanding of the response-sequence 
mechanisms and a successful combina- 
tion of these response tendencies with 
the stimulus variables would allow one 
to predict the trial-by-trial behavior of 
the subject. Internal noise effectively 
prevents such a prediction, since there 
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class of determiners including set, previous 
training, method of responding, and so on. 
The essential feature of this class of con- 

ints is that its effects in judgments are 
the same no matter what stimulus has been 
presented to the subject. In short, guessing 

bits act independently of stimuli. It fol- 
lows that guessing can be measured by meas- 
uring the predictability of successive judg- 
ments... p. 297].” 
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is no way, by definition, to monitor the 
effects of this variable. 


AIM OF THE Present Resnarci 


The aim of this paper is to develop 
an experimental technique that will al- 
low one to estimate the magnitude of 
the nonstimulus factors and to assess, 
at least crudely, the relative importance 
of stimulus and nonstimulus factors. 
The present experimental results do 
not completely achieve the aim. The 
experimental technique does, however, 
have the potential to apportion the 
relative variability caused by internal 
noise and response dependencies, and 
we shall return to this potential later in 
the paper. The basic rationale of the 
procedure can be quickly summarized. 
If the stimulus alone determined the 
subject's response, and the subject were 
presented with the same stimulus on 
two separate occasions, we would ex- 
pect the same response. Even if each 
individual judgment were subjected to 
some variability, we might expect the 
judgments to tend to be in agreement. 
Obviously, the exact amount of agree- 
ment, the degree of consistency, would 
depend on the relative influence of the 
stimulus factors and the amount of this 
internal variability. Let us now 
consider the procedure. 


EXPERIMENTAL PROCEDURE 


A two-alternative, forced-choice pro- 
cedure was used throughout the ex- 
periments. Two types of trials were 
presented to the observer. One type, a 
signal trial, had a signal added to the 
noise background. The signal oc- 
curred at random in one of the two 
time intervals of the trial. After the 
subject made his response, he was in- 
formed whether or not it was correct. 
In the other type of trial, a no-signal 
trial, no signal was added in either of 
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the two temporal intervals of the 
forced-choice trial. Thus the observer 
simply listened to two samples of 
noise. No information followed a no- 
signal trial, since no “right” or “wrong” 
response is possible. 

A sequence of 100 trials constituted 
a single experimental block, which 
lasted approximately 5 minutes and 
consisted of 50 signal and 50 no-signal 
trials.“ The purpose of mixing signal 
and no-signal trials was (a) to provide 
the observer with a constant reminder 
of the type of signal used in the psycho- 
physical test and a means of reinforc- 
ing appropriate responses to that 
signal and (b) to monitor the level 
of the observer’s performance through- 
out the test sessions. 

The dependent variable of the ex- 
periment is based on a comparison of 
the subject’s response during the first 
and second presentations of the tape. 
Consider only the no-signal trials. If 
the subject selects the same interval as 
his response on both the first and sec- 
ond presentations of the audio tape 
recording, then this constitutes an 
agreement; if the responses differ, 
there is no agreement. The percent- 
age of agreements is calculated from 
the ratio of agreement trials to the total 
number of no-signal trials. This per- 
centage will be the main dependent 
variable of the experiments discussed 
here. 

All audio information was recorded 
on a single channel of the magnetic 
tape ; on the second channel the timing 
information, that controls the various 
visual indicators that accompany the 
auditory information, was stored. 
This dual channel arrangement ensures 
almost perfect synchrony between the 
occurrence of an audio event and 

Presentation order was determined by 


random sampling without replacement, from 
a set of 100 items—half even, half odd. 
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a corresponding observation-interval 
light. This correspondence is essen- 
tial since the signal duration was only 
1/10 second. 

An Ampex Model 320 tape recorder 
was used. The audio information on 
the tape had a signal-to-noise ratio in 
excess of 40 decibels. While it is im- 
possible to guarantee that the two pres- 
entations of the tape were exactly alike, 
every precaution was taken to approach 
that goal. The largest source of stimu- 
lus variability probably arose because 
of variation in the speed of the tape 
through the heads and slight variation 
in contact between the playback head 
and the magnetic tape. These varia- 
tions produce both amplitude and 
frequency modulation of the audio 
signal,?° 


EXPERIMENTAL RESULTS 


Two main experiments were con- 
ducted to probe the manner in which 
differences in the details of the experi- 
mental procedure might influence our 
estimate of the observer’s consistency. 
The two experimental questions were: 
(a) How does the level of the signal 
presented on the signal trial influence 
the consistency measured on the no- 
signal trial? (b) How does the char- 
acter of the signal and the exact pres- 
entation procedure employed on the no- 
signal trials influence the observers 
consistency ? 


Experiment I 


Noise was used as the signal in this 
experiment, that is, the signal was an 


10 Certain aspects of the taped signal 
(power or energy level in the signal interval) 
were repeated almost exactly since for cef- 
tain recordings simple electronic detection 
systems (broad-band energy detectors) Wale 
able to achieve 100% consistency in juds 
ments based on two presentations of the 
tapes. 
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increment in power of the continuous 
background noise. Three levels of in- 
crement for the signal were selected so 
that the subjects would receive ap- 
proximately 60, 75, or 90% correct at 
the highest signal level. Four tapes, 
each containing 100 trials, were pre- 
pared at each signal level. The sub- 
ject listened to each tape six times. 
Percentage agreement scores were cal- 
culated by comparing the response 
given on successive presentations of 
the tape. The six presentations yielded 
three independent estimates of a per 
cent agreement score. 

A two-stage sampling process is in- 
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volved in this procedure. By re- 
peating the same recording several 
times and determining the percentage 
agreement for successive pairs of pres- 
entations, we obtain an estimate of the 
percentage agreement score for that 
particular tape sample and its particu- 
lar pairs of noise stimuli. By using 
several different tapes, we obtain dif- 
ferent estimates from the population 
of stimulus pairs and hence estimates 
of the consistency that is possible in 
this particular physical situation. 

In the present instance, our estimate 
of the percentage agreement for each 
tape is based on 150 trials (50 no- 


TABLE 1 


PERCENTAGE AGREEMENT SCORES AND PERCENTAGE CORRECT DETECTION SCORES 
(IN PARENTHESES) BASED ON 150 OBSERVATIONS 


Weak signal 
Observer 1 Observer 2 Observer 3 mono 
Tape 1 69 (57) 57 (55) 69 (57) 65 (56) 
2 71 (68) 63 (62) 53 (61) 62 (64) 
3 66 (58) 66 (55) 67 (63) 66 (59) 
4 69 (61) 70 (60) 65 (57) 68 (59) 
Mean over tapes 69 (61) 64 (58) 64 (60) 65 (60) 
oeh eames nm S E 
Medium signal 
O OO G 
Observer 1 Observer 2 Observer 3 9 over, 
bE ane E 
Tape 1 61 (82 64 (77) 54 (76) 60 (78) 
2 73 605 60 (76) 58 (75) 64 (73) 
3 75 (68) 71 (77) 63 (68) 70 (71) 
M 4 68 (73) 65 (73) 66 (68) 66 (71) 
lean over tapes 69 (73) 65 (76) 60 (72) 65 (74) 
Strong signal 
Mean over 
Observer 1 Observer 2 Observer 3 N 
Tape 1 61 (93 60 (91) 52 (88) 58 (91) 
2 69 15 62 (92) 62 (93) 64 (03) 
M 2 60 (90) 59 (80) 66 (91) 65 (90) 
ean over tapes 66 (92) 60 (91) 60 (91) 62 (91) 


ent PS3 ht amplitude distortion of the noise was 
p was indicated that slight amy 
Wenk. The obtained ende dere detailed f. 2 00; Obe wer I, 60(82) ; Observer ITT, 56 (82). 
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signal trials, and 3 pairs of observa- 
tions). Our consistency estimate for 
this physical situation is based on 200 
sample pairs of noise stimuli (50 no- 
signal trials per tape, and four tapes). 

The results of this experiment are 
given in Table 1 in which the average 
percentage of correct detections for all 
six presentations of the tape is pre- 
sented together with the average per- 
centage agreement score based on three 
pairs of presentations of the tape. 

The grand means for the three condi- 
tions are indicative of the main con- 
clusion, namely, there does not seem 
to be a strong relation between the 
percentage of correct detections ob- 
tained on signal trials and the consist- 
ency of the observer’s judgments as 
inferred from the no-signal trials. 
Each individual observer supports this 
conclusion. 

We shall postpone a more detailed 
discussion of how response dependen- 
cies might influence the percentage 
agreement score until the remaining 
experimental results have been dis- 
cussed. 


Experiment II 
Consistency with Sinusoidal Signals 


The main question explored in this 
series of experiments was: Does the 
consistency of the observer’s judgments 
depend heavily on the kind of signal 
used in the tests? In the results just 
discussed, the signal was an increment 
in the level of a noise. In this experi- 
ment the signal was a gated sinusoid 
of 1/10-second duration. Four signal 
frequencies were investigated, 250 cps 
500 cps, 1,000 cps, and 2,000 cps. At 
each frequency four tapes were pre- 
pared and the subject listened to each 
tape four times. Table 2 lists the re- 
sults obtained in these experiments. 


The average percentage agreement is 


approximately 70% at all frequencies 
except 1,000 cps for which the average 
is 78%. All subjects had participated 
in another experiment of several weeks 
duration in which the signal frequency 
was 1,000 cps, and this may account 
for the slightly higher percentage 
agreement score obtained at this fre- 
quency. Note, however, that this re- 
sult is almost wholly determined by the 
results of the third subject. 


Other Signal Conditions and 
Presentation Procedures 


In addition to the two stimulus con- 
ditions described thus far, a variety of 
other presentation procedures were in- 
vestigated. Most of these situations 
were not tested exhaustively, although 
an attempt was made to sample the 
variety of different procedures in order 
to determine if any one would produce 
outstandingly consistent behavior. 

The following conditions were in- 
vestigated : 

1. Tapes with silence between the 
observation intervals. The pulse audio 
wave form of 1/10-second duration 
was presented during the observation 
interval. The wave form was either 
noise alone or signal plus noise. 

2. A pedestal was employed. In 
both observation intervals of the 
forced-choice procedure, a stimulus of 
the same kind as the signal was added 
to the background noise. The signal 
was then added in phase to this con- 
stant increment in one of the two inter- 
vals on the signal trial. Thus the 
subject’s task was to select the more 
intense of the two pulsed sinusoids. 

3. The same signal was presented in 
both intervals of the forced-choice pro- 
cedure, Instead of 50 trials with ne 
signal in either interval of the forced! 
choice procedure, a tape was prepare 
with 50 trials in which the same sign 
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TABLE 2 
PERCENTAGE AGREEMENT SCORES BASED ON 100 OBSERVATIONS 


Signal 250 cps 
Observer 1 Observer 2 Observer 3 22 i 
—. . — a 
Tape 1 68 72 70 70 
2 80 72 68 73 
3 76 54 78 00 
d 4 76 52 78 6 
Mean over tapes 75 62 7 7 
— ——— — — 
Signal $00 cps 
DEES SS 
Observer 1 Observer 2 Observer 3 . — 
ss ee 
Tape 1 84 67 45 65 
2 90 8 * 51 74 
3 74 64 57 65 
4 70 78 70 73 
Mean over tapes 80 73 56 Me 


Tape 1 76 77 85 
2 86 73 79 n 
3 83 75 77 78 
4 76 71 79 p 
Mean over tapes 80 74 80 15 


— 2 Son Oe 


Was presented in both tem in- 5. The background level of the noise 
tervals, Since the signal aii was raised and lowered over a 40-deci- 
in both intervals, it cannot provide the bel range to determine whether the 
cue for a choice between the intervals. absolute level of the background noise 

4. Complex signals, generated by had an appreciable effect on the ob- 
adding four sinusoids together, were server's consistency. This was done 
also tried as the signal in the percent- both with noise as a signal and for 
age agreement analysis. some sinusoidal-signal conditions. 
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None of these special presentation 
modes produced percentage agreement 
scores that appeared, on the basis of 
small samples, to be very different 
from those already reported in detail. 
In general, there was a small but con- 
sistent trend for tapes using sinusoidal 
signals to generate somewhat higher 
percentage agreement scores than tapes 
in which the signal was an increment 
in the power of noise. That the stimu- 
lus situation does not heavily influ- 
ence the percent agreement scores sug- 
gests that the percentage agreement 
score may be largely determined by 
factors other than stimulus. Let us 
consider this alternative in more detail. 


RESPONSE DEPENDENCIES 


Because dependencies among the ob- 
server's responses can affect the per- 
centage agreement score, we must dis- 
cuss this dependency in some detail 
and try to estimate the magnitude of 
its effect in these experiments. The 
logical starting point is the simplest 
response dependency that one can en- 
counter in a _ two-alternative task, 
namely, a pure response bias. 


Pure Response Bias 


If the observer always selected the 
same interval on all trials, his percent- 
age agreement score, as analyzed from 
the no-signal trials would be 100%. 
To the extent that any observer in- 
creases the relative frequency of one 
response over another, the percentage 
agreement score is bound to increase 
the amount of increase is easily deter- 
mined. Ifthe subject chooses the first 
interval of some no-signal trial with 
probability p, then he chooses the 
second interval with probability q= 
(1— p). Assuming independence be- 
tween the two presentations the proba- 


Davin M. GREEN 


bility of agreement is simply the sum 
of these probabilities of agreement on 
the first interval p? plus the probability 
of agreement on the second interval q°. 
That is, if only the response bias is 
operating and the subject chooses in- 
terval 1, 60% of the time, then his 
mean percentage agreement score is 
52% [(.6)? + (.4)?]. 

As this example indicates, it takes a 
sizable bias to generate any appreciable 
effect on the percentage agreement 
score. In fact, if the bias, b, is taken 
to be the excess of the most popular 
interval over 4, then the increase in 
the probability of agreement is 20°. 
Thus for the percentage agreement 
score to exceed 60% on the basis of a 
response bias alone the selection of the 
most popular interval must exceed 
72%. The largest pure response bias 
for any condition of the experiment 
was a selection of the first interval 
62% of the no-signal trials. It is 
therefore not surprising that no sig- 
nificant percentage agreement score 
was found in any of the data collected. 

There is a variety of other simple 
response biases that one might con- 
sider, for example, alternation and 
double alternation, but such response 
chains can produce large percentage 
agreement scores only if the sequence 
starts on the same trial during the 
first and second presentation of the 
tape. In short, the start of the re- 
sponse chain must be synchronized to 
begin at the same place during bot 
presentations, Thus pure response de- 
pendencies may have some influences 
on percentage agreement scores but 
their effects are certainly small. Let 
us consider a more likely candidate. 


11 It should be remembered that respons? 
bias was carefully monitored in the course ° 
the experiments and the observers caution 
about over-frequency responses to one or the 
other intervals. 
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Stimulus-Response Dependencies 


The occasion of the signal on the 
signal trials, or the subject’s correct or 
incorrect response to that signal, does 
provide the synchrony needed to trig- 
ger response sequences that could gen- 
erate agreements among the subject’s 
responses on the successive presenta- 
tions of the tapes. A careful analysis 
of the response following signal trials 
was, therefore, undertaken. Only one 
consistent pattern emerged. There was 
a tendency among all observers to 
choose the interval opposite the one 
on which they had just been correct. 
The effect was not large (60% at 
most) but was fairly consistent. 

Such an effect, by itself, will not 
seriously affect percentage agreement, 
partly because, like a pure-response 
bias, the extent to which the bias affects 
the percentage agreement score is pro- 
portional to b? rather than b. 

Furthermore, the bias observed, 
namely, to choose the interval oppo- 
site the interval that had just been 
correctly chosen, should show a varia- 
tion with the percentage of correct 
detections on the signal trials. The re- 
sults of Experiment I did not indicate 
such a trend. 

There are many more analyses that 
might be undertaken in an effort to ab- 
stract response dependencies from the 
data? One then might attempt to 
calculate their probable effect on the 
data obtained. A much more positive 
approach to this question would be to 
conduct the following experiment. 


Published data on this problem does 
not give very clear advice on what bias one 
might expect. At last count, there were five 
studies reporting a tendency to alternate 
response alternatives, whereas three other 
studies reported that the subject tended to 
repeat the same alternative more than one 
Would expect on the assumption of inde- 
Pendence, 


403 


Crucial Experiment 


The basic question is: How much of 
the percentage agreement score is the 
result of response dependencies, and 
how much is the result of stimulus fac- 
tors? Any percentage agreement score 
reported above is an upper bound on 
the operation of stimulus factors be- 
cause each score could be the result of 
response dependencies alone. The ob- 
vious way to investigate this problem 
is to generate a tape in which exactly 
the same stimulus occurs on both inter- 
vals of the no-signal trials. That is, 
the 50 no-signal trials might consist 
of 50 different samples of noise, but 
each noise sample would occur in 
pairs, the same wave form occurring on 
the first and second intervals of the 
forced-choice trial. Thus on a no-sig- 
nal trial there would be not only no 
signal but no stimulus difference. Any 
percentage agreement score greater 
than chance would indicate the opera- 
tion of response tendencies alone. It 
is the author’s guess that the percent- 
age agreement scores will be near 
chance under these conditions, but this 
is only conjecture, since admittedly any 
of the percentage agreement scores re- 
ported above could be the result of 
some intricate and undiscovered re- 
sponse chain. 

Constructing such a tape would be 
most easily accomplished if a computer 
were available to generate the stimuli. 
One could also easily produce tapes in 
which some of the no-signal trials con- 
tain two different random samples of 
noise, whereas other trials contain 
identical noise samples. Then the rela- 
tive consistency of the observer's be- 
havior resulting from stimulus factors 
could be directly compared with those 
resulting from response factors. This 
experiment would also serve to clarify 
the amount of internal or biological 
noise present in these experiments. 
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ESTIMATES OF INTERNAL NOISE 


If we assume that all of our agree- 
ment scores are the result of stimulus 
factors, then we can estimate from the 
lack of agreement the magnitude of 
internal noise. Our estimates of in- 
ternal noise are minimal estimates. 
Should the response dependency ac- 
count for a large part of the agreement 
score, our estimate of the influence of 
stimulus factors would have to be re- 
duced and our estimate of the amount 
of internal noise increased. 

Estimates of internal noise can be 
obtained on the basis of the following 
model. The process of the observer 
listening to a no-signal trial can be 
likened to drawing two samples from a 
Gaussian population. Let the samples 
representing the two intervals be de- 
noted x, and x, We assume that 
both have zero mean and variance, 
og’. This variance represents the 
variability of the external noise. The 
simplest assumption is that the internal 
noise sample, which we denote y; is 
added to the external noise. 

The following simple decision rule is 


If x. + 51 > %2+ Yo, 

then choose interval one; 
If x, +y < tt 92. 

then choose interval two. 


The first sample of internal noise, y,, 
should be independent of the second 
sample , and we denote the variance 
of y; as or. 

When the tape is presented the sec- 
ond time, x, and x, again occur as the 
samples from the taped external noise. 
These samples are now combined with 
new samples of internal noise; let us 
denote the new samples y, and y,. 
Once again the same decision rule is 
assumed, except / replaces y. 
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TABLE 3 


PROBABILITY OF AGREEMENT P(A) VERSUS 
RATIO OF STANDARD DEVIATION OF 
EXTERNAL AND INTERNAL 


Noise 
Oo o AANO i 

P(A) 65e P(A) øgler 
0.503 0.10 0.803 2.10 
0.512 0.20 0.811 2.20 
0.526 0.30 0.818 2.30 
0.544 0.40 0.824 2.40 
0.564 0.50 0.830 2.50 
0.585 0.60 0.836 2.60 
0.607 0.70 0.842 2.70 
0.628 0.80 0.847 2.80 
0.648 0.90 0.851 2.90 
0.667 1.00 0.856 3.00 
0.684 1.10 0.875 3.50 
0.701 1.20 0.890 4.00 
0.716 1.30 0.901 4.50 
0.730 1.40 0.911 5.00 
0.743 1.50 0.925 5.00 
0.755 1.60 0.936 7.00 
0.766 1.70 0.944 8.00 
0.777 1.80 0.950 9.00 
0.786 1.90 0.955 10.00 
0.795 2.00 0.982 25.00 

0.995 100.00 


Agreement with the previous de- 
cision will be obtained if the inequality 
is in the same direction on both presen- 
tations. 

Table 3 shows the probability of 
agreement for several ratios of exter- 
nal-to-internal noise.** 

Since the average percentage agree 
ment is nearly 66%, this model would 
suggest as a first approximation that 
the internal variability is nearly equa 
to the external variability. i 

Before we comment further on this 
estimate of the amount of interna 


18 The calculations shown in Table 3 were 
made at the Computation Center, Massachu- 
setts Institute of Technology, on an I 
7090 computer. The author wishes to 
acknowledge assistance from the Computa- 
tion Center, Massachusetts Institute of Tech- 
nology, and from Peter Reich who pro- 
gramed the computation. 
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noise, we should recognize the exist- 
ence of a completely different set of 
assumptions which might also be used 
to account for the fact that the observer 
is not completely consistent. Rather 
than assume that some sensory or de- 
cision device is noisy or unstable, we 
might assume, instead, that the system 
has finite resolution. That is, we 
might assume that the decision device 
cannot distinguish between two sensory 
magnitudes unless they differ by some 
minimal amount. Such a quantal as- 
sumption would explain the consistency 
data reported in this paper. Whether 
this assumption is tenable in general 
would depend on the exact assumptions 
that one made concerning the nature of 
the processes. Certainly internal noise 
and quantal effects are often difficult to 
disentangle. The separation of these 
two hypotheses, which are very differ- 
ent in spirit at least, remains a chal- 
lenging and important problem. 

We return to our estimates of inter- 
nal noise based on the linear model. 
What are we to make of this estimate? 
There are certainly a good many as- 
sumptions in the model used to gen- 
erate the calculations. The estimate is 
also only a lower bound. By itself this 
estimate of the amount of interval noise 
would be highly suspect. The surpris- 
ing, and somewhat startling fact is that 
this paper represents the third method 
used to estimate this ratio and that all 
three methods yield estimates of the 
ratio very near one. 

Swets, Shipley, McKey, and Green 
(1959), in determining the improve- 
ment from repeated observations of 
the same stimulus, estimate that the in- 
ternal variability of the subject must 
be approximately the same as the ex- 
ternal variability. Their estimates of 
°n/or range between 1.0 and 1.4. 

Watson measures a ratio similar to 


on/oy by comparing what he calls an 
individual versus a group receiver op- 
erating characteristic curve. He esti- 
mates the value of this ratio is nearly 
one. 

Whether this ratio is actually meas- 
uring the relative amount of internal 
and external variability is still uncer- 
tain. The consensus clearly suggests 
an interesting invariance in some aspect 
of the psychoacoustic situation. 


CONCLUSIONS 


The results show that the estimate 
of the ratio of internal-to-external 
variability is approximately one, and 
that this ratio is essentially constant 
over a wide variety of experimental 
conclusions. This estimate is in close 
agreement with similar estimates made 
by other investigators who studied both 
amplitude discrimination and the de- 
tection of a 1,000-cps sine wave in noise. 
This paper extends the range of ex- 
perimental conditions by studying sinu- 
soids of various frequencies, as well 
as signals that are samples of noise. 

Another way of looking at this in- 
variance is to note that the observer 

ts his previous choices on approx- 
imately 70% of the two-choice trials. 
This result has important implications 
for the development of a molecular 
psychophysics, a psychophysics that 
attempts to predict the trial-by-trial 
behavior of the observer. This 70% 
consistency, since it appears to be inde- 
pendent of the sensory task, represents 
an upper bound on any attempt to pre- 
dict trial-by-trial behavior. It is, of 
course, possible that some still undis- 
covered variable may permit us to raise 
this ceiling, but at present it remains 
the highest that can be achieved by a 
realistic simulator of an individual ob- 
server, namely that same observer on 
another occasion. 
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Certainly this comparatively low 
consistency score will dampen the en- 
thusiasm of those who might wish to 
construct detection devices that will 
perfectly mirror the subject’s trial-by- 
trial behavior. Indeed, not only is the 
consistency low, but the source of this 
rather meager degree of consistency is 
somewhat ambiguous, for it remains to 
be seen whether this 70% score is 
largely caused by stimulus or response 
processes. A simple extention of the 
present technique is proposed that 
should clarify this issue. However, 
even this rather low degree of consist- 
ency does not entirely preclude a 
molecular approach. 

Throughout this paper, only an ex- 
treme form of molecular psychophysics 
has been considered, a psychophysics 
in which the level of analysis was the 
individual response. One might, how- 
ever, present the same pair of wave 
forms 20 or 30 times, and measure the 
“most common” responses to the pair. 
The subject matter of such studies 
would be the relation between the 
acoustic wave form and this “most 
common” response. 

The consistency of this “most com- 
mon” response could be measured by 
a simple extension of this technique 
which was used to estimate the consist- 
ency of individual responses. Such ex- 
tensions would provide interesting in- 
formation about the manner in which 
the number of individual judgments 
used to determine the “most common” 
response affect the estimated variabil- 
ity of the internal factors. 

Admittedly, the use of this most- 
common-response technique can never 
be as efficient as a trial-by-trial analy- 
sis, but comparing the prediction of 
two theories on this quasi-molecular 
basis may be considerably more effi- 

cient than trying to distinguish between 


molar predictions of threshold inten- 
sities that are only one decibel apart. 

The development of some form of 
molecular psychophysics seems as in- 
evitable as the development of more 
quantitative theories of sensory func- 
tions. Indeed, more and more crucial 
tests of such theories will be possible on 
the molecular level as they become 
more exact and quantitative. Detailed 
analyses of the determinants of the 
individual responses are necessary if 
we are to make reasonable and sensible 
use of such molecular data. 
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A THEORY OF CLASSICAL CONDITIONING 
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In view of the logical reasons and empirical evidences, the distinction 
between classical and instrumental conditioning is made in terms of 
experimental control. The stress on the nature of experimental con- 
trol naturally leads us to identify 2 major components of conditioning. 
The function of the first component depends on the transfer of energy 
release (conditioning of autonomic responses) responsible for the 
initiation of a skeletal CR. Without operation of the second, the in- 
strumental component defined by the effect of CR on UCS, the acqui- 


sition of CR is transient. 


Classical and instrumental condition- 
ing experiments refer, respectively, to 
training procedures in which the sub- 
ject’s response does not or does change 
the UCS. The reference experiment 
for classical conditioning is the study 
of conditioned salivation in dogs by 
Pavlov. Other experimental proce- 
dures which have also come to be called 
conditioning derived from the work of 
Bekhterev and Thorndike. The latter 
methods differ from the classical Pav- 
lovian procedures in that the subject’s 
behavior is instrumental to the produc- 
tion of reward or avoidance of punish- 
ment. Hence the name instrumental 
conditioning. 

However, as was pointed out by 
Kimble (1961), it is difficult to pro- 
duce pure cases of classical or instru- 
mental conditioning. On the one hand, 
it is impossible to perform an instru- 
mental-conditioning experiment with- 
out also providing the conditions for a 
classical one. Since a reward or pun- 
ishment is an essential part of an 
instrumental-conditioning experiment, 
the situational cues regularly associated 
with reward or punishment function as 
conditioned stimuli. Similarly, unless 
the UCS precedes the CS in a classical 
conditioning experiment, the CR usu- 
ally has some effect on the UCS. For 
instance, in eyelid conditioning, the oc- 
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currence of an anticipatory CR part ti 
avoids the following airpuff and din 
ishes its intensity, introducing an i 
strumental component into the learnin 
which takes place in a typical classic 
procedure. The manifestation of 
instrumental component appears in 
latency data of CR employing vari 
interstimulus intervals. In Spoon 
and Kellogg's study (1947) of fingen 
withdrawal conditioning, the mes 
latencies of the CR for groups cone 
tioned at CS-UCS intervals of 5 1.0, 
and 1.5 seconds were, respectiv 


experiment as classical procedure 
used and the UCS (shock) alv 
followed the CS, it may be that t 
anticipatory occurrence of the CR 
quired the secondary rewarding P 
erty through the subject’s past life- 
learning. The last statement sugg 
a possible way of understanding he 
the CR may actually be modified 
strumentally in classical condition 
In the history of the organism, the tet 
mination of painful stimuli has pfi 
ably consistently been preceded by 1 
sponses resembling the CR and, 

in the manner of instrumental co! 
tioning rewarded. This leaves 10 © 
subject a tendency to respond in) 
which bring the reaction close to 
occurrence (in previous life, close 
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the termination) of the UCS. Thus, 
even in GSR conditioning and heart- 
rate conditioning where one would be 
very hard put to show that the CR has 
its effect on the UCS, the response is 
part of a reaction which normally fa- 
cilitates the mobilization of energy in 
an emergency situation. The result 
usually is to put an end to the noxious 
stimulation. The situation is no dif- 
ferent from that in other learning situ- 
ations involving a noxious UCS. 


NATURE OF EXPERIMENTAL CONTROL 
IN CLASSICAL AND INSTRUMENTAL 
CONDITIONING 


In the foregoing, an attempt was 
made to demonstrate the fact that in a 
typical Pavlovian conditioning experi- 
ment, an instrumental component of 
CR is always operating. Often it 
affects the value of the UCS, where it 
does not. Previous learning, where 
such an effect did occur, always seems 
Possible to demonstrate. A considera- 
tion of the problem thus raised suggests 
a new way of looking at the nature of 
Conditioning, the general nature of 
which we turn to now. 

In the study already cited, Spooner 
and Kellogg (1947) also investigated 
backward and simultaneous condition- 
ing. According to these results, the 
Progress of conditioning shows an ini- 
tial rise followed by a decrease to near 
zero toward the end of training. In 
all the instances there is one feature in 
Common. The CR has no effect on the 
UCS. The effect here refers to both 

© primary and the secondary ones. 
This fact, together with the nature of 
the results, suggests that the mere con- 
tiguity of CS and UCS is sufficient to 
Mitiate the CR, but that some other 
Process is necessary to make the CR 
Persist once its establishment by con- 
tiguity has begun. This latter function 
s apparently effected by the instru- 
Mental aspect of the CR. Making 
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this distinction we will refer to the 
process which depends upon the con- 
tiguity of CS and UCS as the initiation 
component and the other process as 
the instrumental component. 

In what is usually called classical 
conditioning, the initiation component 
is under the direct control of the ex- 
perimenter. However, the second, in- 
strumental component, depends upon 
the subject’s behavior, since the avoid- 
ance or weakening of the UCS depends 
on the amplitude and the other charac- 
teristics of the CR which are not under 
the direct control of the experimenter. 

In what is usually called instru- 
mental conditioning the instrumental 
component is directly under the control 
of the experimenter. The amount of 
reward or punishment is controlled by 
the experimenter and delivered for a 
range of responses which differ mark- 
edly in their details. The initiation 
component, however, is not directly 
specifiable in a usual instrumental-con- 
ditioning experiment, because the time 
interval between the presentation of 
the discriminandum (CS) and the oc- 
currence of reward or punishment 
(UCS) is a variable depending on the 
details of the subject’s behavior. 

From the preceding analysis, it fol- 
lows that the usual treatments of classi- 
cal and instrumental conditioning em- 
phasize only one of the two essential 
components of the conditioning process. 


INITIATION AND INSTRUMENTAL 
COMPONENTS OF CONDITIONING 


The two-component theory of con- 
ditioning developed above holds that 
the initiation component is responsible 
for the first occurrence of CR and that 
the instrumental component accounts 
for the later firm establishment or 
change in the CR. The implications 
of this analysis are most significant 
for classical conditioning. To develop 
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some of these implications, we start by 
analyzing the concept of UCS. 

In the first place the UCS is a 
stimulus and therefore capable of 
arousing a certain sensory process 
within the organism, and the more 
important property of the UCS is that 
it regularly evokes a response, the 
UCR. The regularity with which the 
UCS evokes a response of certain 
strength, in one way of thinking, re- 
flects a drive function of the UCS. 
The drive function as used here refers 
only to that function which relates 
stimulus to response. Paired presenta- 
tions of the CS and the UCS (condi- 
tioning) thus may be viewed as a 
means of producing a transfer of drive 
function to the CS. 

The transfer of drive function pre- 
sumably depends on the transfer of 
energy mobilization. In other words, 
the conditioning of autonomic re- 
sponses underlies and is indispensable 
for the conditioning of a skeletal re- 
sponse. By means of the conditioning 
of autonomic responses energy is made 
available for the initiation of the CR 
to an originally neutral stimulus, CS. 
It is well known to a student of learn- 
ing that success in conditioning a reflex 
has been notably better in experiments 
employing electric shock, though the 
reflex may be readily elicited by some 
other stimulus which does not evoke 
autonomic responses. 

The concept of reward such as meas- 
ured by the weight of food is probably 
not a significant variable which psy- 
chologists are looking for. Neither is 
it in accordance with the concept of 
drive reduction nor with the idea of 
fear conditioning. They lack power to 
encompass a variety of behavioral 
phenomena. The transfer of energy 
availability is assumed to be basic 
to the initiation of a CR to a neutral 
stimulus. A brief excitation with the 
subject provides the basis for the 
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transfer of energy availability in a posi- 
tive reward training as well as in a 
situation involving a noxious stimula- 
tion. The initiation component thus 
directly depends on the transfer of en- 
ergy availability. 

Therefore, a close relationship be- 
tween the CS-UCS interval function 
of a skeletal response and that of heart 
beat should be expected. As a matter 
of fact this has already been pointed 
out by Church and Black (1958). 
Namely, the latency of the first avoid- 
ance response, as a function of the CS- 
UCS interval, coincides fairly well with 
the mean latency of maximum heart 
beat. $ 

When the magnitude of the UCR is 
directly related to the amount of energy 
mobilization, the correlation between 
the magnitude of the UCR and that 
of the CR should, according to the 
present analysis, be nearly perfect. 
Such is the case as obtained in the 
study of heart-rate conditioning by 
Black, Carlson, and Solomon (1962). 

How the relationship between the 
conditioning of autonomic responses 
(especially those which are connected 
with energy release) and that of skele- 
tal responses is established through 
the ontogenetic development is a chal- 
lenging problem. In the course of 
the ontogenetic development, whenever 
there is a modification of behavior, 
there is conditioning of those autonomic 
responses responsible for energy te- 
lease. This means that the condition- 
ing of those autonomic responses 15 
much more frequent than that of spe 
cific skeletal responses. Accordingly, 
the conditioning of those autonomic re- 
sponses should be very rapid, because 
we know that repetition of conditioning 
accelerates the rate of conditioning: 
Empirically, the acquisition and extinct- 
tion in heart-rate conditioning are 
within 5 trials. This contrasts W! 
the rate of conditioning of eyeblink or 
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finger withdrawal, where more than 80 
trials are usually required for acquisi- 
tion to an asymptotic level. 

Another implication is that those 
autonomic responses which mediate en- 
ergy release can be conditioned with 
backward procedure. Since the ex- 
perimental evidence indicates that the 
conditioned skeletal response estab- 
lished by backward conditioning pro- 
cedure is transient, those autonomic 
responses, in order to supply necessary 
energy for the initiation of the skeletal 
CR, should be conditioned beforehand. 
There is no empirical study on heart- 
rate conditioning using backward pro- 
cedure. However, an evidence is avail- 
able for the backward conditioning of 
GSR (Jones, 1962). 

We turn now to the nature of the 
instrumental component. As stated 
in the previous section, without the 
operation of the instrumental compo- 
nent, the initiated response is transient. 
In this sentence, some inhibitory func- 
tion of the UCR on the CR, when the 
CR does not affect the UCR, is as- 
sumed. At this point, a question may 
be raised to the fact that heart-rate con- 
ditioning can be obtained in totally 
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secondary effect of heart - rate condi- 
tioning in terminating noxious stimuli 
by means of specific skeletal responses 
has occurred so frequently in the past 
history of the organism that the CR 
(increase in heart rate) has become 
more or less persistent ; even it does not 
affect the UCS. This also explains the 
wide range of the CS-UCS interval 
function in heart rate conditioning. 
Consider the reverse to be true, then 
the conditioning of a specific skeletal 

is scarcely possible, because 
terminate 


tioning of autonomic responses in curarized 
Psychol. Monogr., 1962, 76(29, 


in relation to CS-UCS intervals in classi- 
cal aversive Psychol. Rev, 
1962, 69, 176-186. 


curarized dogs (Black, Carlson, & Sol- Nel an e 
omon, 1962) seven the I es Stoa. A, & rice. W. N ately 
eart rate) cannot very easily alter the a 
effectiveness of the UCS. The answer col, 1 
is in terms of frequency theory. The ( * 
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In the article by Warner R. Wilson and Howard Miller in the 


May 1964 issue, line 3 of column 2 on page 238 should read, 
compatible with r-s since in r-s one 


and on page 241, column 2, line 9 of the first paragraph should 


read, r 
a-s-design (a-s-d) and to the some-dif- 
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THEORETICAL NOTE 


VERBAL TRANSFER AND INTERLIST SIMILARITIES? 


JOHN P. HOUSTON 


University of California, Berkeley 


Degree and direction of verbal transfer were related to first list 
stimulus-second list response similarity (Si-Re) and to second list 
stimulus-first list response similarity (S:-Ri). This was accomplished 
through a relabeling of Osgood’s transfer and retroaction surface such 
that the dimensions referred, not to degree of S:-Ss and Ri-Rz similar- 
ity, but to degree of Sı-Ra and S.-R: similarity. When S.-R: similarity 
is minimal transfer will be minimal, no matter what the degree of S:-R: 
similarity. As SR, similarity increases transfer increases until, when 
Sa and Ri are identical, maximum negative transfer will obtain when 
S:-Rs similarity is minimal and maximum positive transfer will occur 


when S; and Rs are identical. 


Although the importance of the similar- 
ity variable in relation to transfer phe- 
nomena has long been recognized there 
has been a tendency among psychologists 
to focus upon particular loci of similarity 
and to ignore certain others. The sets of 
similarity relationships which have re- 
ceived the most attention in the typical 
two list S;-R,, Sz-Rz verbal transfer sit- 
uation are those expressed by Osgood’s 
(1949) transfer and retroaction surface. 
His three dimensional surface represents 
an attempt to relate direction and degree 
of transfer to first list stimulus-second 
list stimulus similarity (Si-Sz) and to 
first list response-second list response 
similarity (Ri-Ra). 

Two loci of similarity which have been 
virtually ignored in the study of transfer 
may be described as the similarity be- 
tween first list stimuli and second list 
responses (S,-Ry) and the similarity 
between second list stimuli and first list 
responses (Sə-R,). What will be the 
influence, if any, of varying degrees of 
S,-R and S5-R, similarity upon direc- 
tion and degree of transfer? Although 
the verbal learning literature contains a 
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provide a general integrative stateme 
of the relationships between transfer a 
the S1i-Rz and Hz-Ri similarity loci. 
intent of the present paper is to pro 
this kind of a statement, based upon d 
through a relabeling of the dimensions 
Osgood’s transfer surface such that th 
surface corresponds, not to the amou 
of transfer associated with degrees ¢ 
Sı-S2 and Ri-Ra similarity, but to 
amount and direction of transfer a 
ciated with varying degrees of S1-R2 
So-R, similarity. Figure 1 contains 
relabeled surface. 

As is true of Osgood’s surface t a 
vertical dimension represents direction 
and degree of transfer. The length of ; 
the horizontal plane, labeled S1-Ha, TEP- 
resents degree of similarity between first 
list stimuli and second list ae 
The width of the horizontal plane, label 30 
Sa- Ri, represents degree of similarity be- 
tween second list stimuli and first / 
responses. In both cases the similarity 
dimension extends from maximum Sse 2 
larity or identity (labeled 1) to minimum 
similarity or neutrality (labeled 0). 
the first list responses are different f 
and neutral with respect to the seco! 
list stimuli then, no matter what the de 
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Transfer as a function of S:-Rs 
and SR. similarity. 


Fic. 1. 


gree of second list response-first list stim- 
ulus similarity, transfer will be minimal. 
As the degree of Sə-R, similarity in- 
creases transfer will increase until, when 
So is identical to Ri, maximum positive 
transfer will obtain when Rg is identical 
to S, and maximum negative transfer will 
occur when Ra and S are different and 
neutral, 

The surface is not completely unsub- 
stantiated. Although little attention has 
been given to S4-Ro and So-R, transfer 
effects per se, the verbal learning litera- 
ture does contain some data which docu- 
ment a number of points on the surface. 
Murdock (1956) found that A-B learning 
will facilitate subsequent B-A learning. 
His experimental paradigm corresponded 
to the point of maximum positive trans- 
fer on the proposed surface where S; is 
identical to Rp and S% is identical to Ry. 
Murdock explained this instance of posi- 
tive transfer in terms of B-A strength es- 
tablished during the A-B learning stage. 
Tn another study Murdock (1958) has 
confirmed an additional point on the 
surface, He employed an A-B, B-C 
Paradigm and found a negative transfer 
effect during second list learning. This 
Situation corresponds to the point of 
maximum negative transfer where Sa 
and R, are identical and Ra and Si are 
different and neutral with respect to each 
other. Murdock proposed that the B-A 
Connection established during A-B learn- 
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ing will compete with the B-C learning 
during the second stage. 

An experiment by Young (1961), 
while not conforming to the usual two 
list transfer situation, produced some per- 
tinent data. He constructed lists such 
that all items served as both stimuli and 
responses and found that these lists were 
more difficult to learn than lists whose 
items did not serve in the double capacity. 
The proposed transfer surface would pre- 
dict this result. His lists were, in effect, 
composed of B-C pairs preceding A-B 
pairs and A-B pairs preceding B-C pairs. 
According to the surface the former se- 
quence of pairs should produce no trans- 
fer while the latter should produce 
negative transfer effects through the com- 
petition of the B-A associations with the 
B-C associations. Thus slow overall 
learning would have been predicted for 
the double-function list. 

Further documentation of the surface 
may be found in the field of verbal media- 
tion. Although not strictly applicable to 
the similarity surface contained in Figure 
1, as some of the relationships are asso- 
ciative, these data are at least suggestive 
of the kinds of results which might be 
obtained with similarity measures. 
Kjeldergaard and Horton (1962) em- 
ployed two mediation paradigms which 
may be located on the transfer surface. 
The first was a C-B, B-A, A-C paradigm. 
If one thinks of this situation as a two 
stage B-A, A-C transfer paradigm with 
the C-B list establishing a relationship ( in 
this case associative) between the first- 
stage stimuli (B) and the second-stage 
responses (C) then it corresponds to a 
transfer situation with maximum S- Ni 
similarity (A and A) and some positive 
amount of Si-Ra relatedness (B-C). In 
accordance with the transfer surface pre- 
diction for such a paradigm (see * in 
and 1) Kjeldergaard and Horton 
‘ound A- C facilitation or positive trans- 
fer. The second relevant paradigm was 
an A-B, C-B, A- C situation. Once again, 
if A-B is interpreted as a definition of a 
positive relationship between the first- 
stage response (B) and the second-stage 
stimulus (A) then the paradigm corre- 
sponds to a transfer situation in which 
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Sy-R, (C-C) is maximum and S,-R, 
(A-B) is some positive value (see ** in 
Figure 1). Kjeldergaard and Horton re- 
ported A-C facilitation. The transfer 
surface predicts positive transfer if A-B 
strength is assumed to be of some minimal 
degree. 

Thus there is some evidence or support 
for a number of points on the surface. 
Of course, the majority of the relation- 
ships depicted by the surface have never 
been tested and require empirical verifi- 
cation, particularly under constant experi- 
mental conditions. 

The surface is intended as nothing 
more than a rough index of the relation- 
ships existing between transfer and S4-R, 
and Fz-Ri similarity. Presumably vary- 
ing materials, experimenters, and situa- 
tions will produce variations in the shape 
of the surface. Qualitative as well as 
quantitative changes in the relationships 
between stimuli and responses should also 
influence the shape of the surface. Thus, 
formal similarity, judged similarity, asso- 
ciative strength, synonymity, and associa- 
tive overlap may each generate surfaces 
which, while conforming to the general 
characteristics of the Figure 1 surface, 
may differ from each other considerably. 

The proposed surface, as a statement of 
the relationships between transfer and 
two specific similarity loci, suggests a 
number of directions for further study. 

As mentioned, a good deal of experi- 
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mental verification of the relationship 
expressed by the surface would be desir- 
able. Comparisons of surfaces generated 
by different relatedness indices wou 
greatly increase our knowledge of 
interactions between transfer and specific 
measurement techniques. Finally, cla 
fication of the relationships between tran 
fer and similarity through an integration 
of the relationships expressed by Osgood’s 
surface and those contained in the present 
model would seem to be of considerable 
importance. 
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An increasing number of experiments have been concerned with vari- 
ous aspects of reinforcing brain stimulation. The dramatic nature of 
many of the findings, however, has often forced methodological and 
interpretive problems into the background. The present paper dis- 


cusses some of the problems encountered in: (a 
forcement strength, (b) determining stimulus 


measuring the rein- 
resholds, (c) inter- 


preting interactions between specific neural areas. 


The discoveries that electrical stimu- 
lation of the brain may have positive 
and negative reinforcing properties 
have triggered experimentation and 
speculation which have been increas- 
ing at a rapid rate.? The possibility 
that the road was now open to the dis- 
covery of basic physiological mecha- 
nisms of motivation and emotion 
aroused the active interest of psycholo- 
gists, physiologists, pharmacologists, 
anatomists, and others. Because of 
the interdisciplinary nature of much 
of the research and in part because 
the dramatic nature of the findings 
tended to force questions of method- 
ology into the background, many of the 
complexities of the methods adopted 
have not been fully appreciated. As 
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For a comprehensive review the reader 


is referred to Olds, 1962, 


there have been a number of attempts 
in recent years to integrate experimen- 
tal findings into theoretical schemes, it 
would seem appropriate at this time to 
examine some of the problems of meas- 
urement and interpretation of the re- 
sults obtained with reinforcing brain 
stimulation. 

In emphasizing the problems of 
measurement and the determination of 
what has been measured there is no in- 
tent to restrict exploration, but rather 
to locate buoys which point out where 
there is a risk of running aground. If 
more attention is not given to these 
problems there is the danger of being 
overwhelmed by an accumulation of 
anecdotal reports which will seriously 
impede progress in this field. 

Although there may be many ways 
of organizing a discussion of the prob- 
lems of measuring and interpreting 
the reinforcing consequences of brain 
stimulation, the major issues can be 
conveniently, if somewhat arbitrarily, 
grouped under three headings: (a) 
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measurement of reinforcement strength, 
(b) determining stimulus thresholds, 
and (c) interpreting interactions be- 
tween specific neural areas. 


MEASURING REINFORCEMENT 
STRENGTH 


Following the initial demonstration 
that animals would respond in order to 
obtain electrical stimulation of certain 
brain areas, it soon became apparent 
that methods to quantify the strength 
of the reinforcement were needed. 
Studies of the relative reinforcement 
strength of stimulation of different 
neural areas and estimates of change 
in reinforcement value with different 
stimulus parameters, drugs, ablations, 
gonadectomy, and deprivation have 
been the subjects of numerous studies 
(see Olds, 1962). These studies all 
require methods of determining change 
in the reinforcement properties of 
brain stimulation. 

The most widely used measure of 
the strength of a reinforcer is the rate 
of making some response (usually lever 
pressing) which is followed by the 
presentation of that reinforcer. In ad- 
dition to the convenience of this meas- 
ure there is much appeal in the argu- 
ment that the length of time permitted 
to elapse between responses reflects the 
intensity of the desire for the reward. 
There are difficulties, however, of both 
an empirical and logical nature. Em- 
Pirically, it has been shown that re- 
sponse rate can be a misleading index 
of reinforcement with brain stimulation 
as results may not agree with an ani- 
mal’s preference (Hodos & Valen- 
stein, 1962) or with measures of re- 
sistance to competition from other re- 
inforcers such as food and shock avoid- 
ance (Valenstein & Beer, 1962). At 
high intensities, for example, response 
rate usually declines because motoric 
side effects of the stimulation disrupt 
performance, but animals may choose 
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those intensities over a lower ampli 
tude stimulation that supports a higher 
response rate. 

Hawkins and Pliskoff (1964) have 
confirmed these findings recently with 
a new technique. These authors have 
employed a two-member behavioral 
chain in which responses on the first 
lever were reinforced on a variable- 
interval schedule with the insertion of 
a retractable lever. Responses on the 
retractable lever provided brain-stimu- 
lation trains on a continuous reinforce- 
ment schedule. A comparison of the 
rates on the two levers revealed that 
response rate on the first lever com- 
tinued to increase at stimulus intensi- 
ties higher than those which produced 
peak response rates with continuous 
reinforcement. It was concluded that 
brain stimulation could not be assessed 
adequately by self-stimulation rates 
with continuous reinforcement. __ 

Logically, there are also difficulties. 
When average response rate is used as 
a measure of the value of the reinforcer 
to the animal there exists an implicit 
assumption that reinforcement streng 
is homogeneous throughout the testing 
session. This assumption can not al 
ways be met with brain stimulation aS 
some effect of the stimulus which pet 
sists may change the value of the fe- 
inforcer after its administration. The 
finding that seizure activity is fre- 
quently associated with reinforcing 
brain stimulation (Newman & Feld- 
man, 1960; Porter, Conrad, & Brady, 
1959) suggests that some aspect of the 
stimulation may persist and obviate 
the necessity to respond immediately. 
Equally likely is the possibility that @ 
second stimulation may result in a 
qualitatively different experience 
sufficient time has not elapsed. 

Some data collected in our labor® 
tory is pertinent to the point. It 
been observed repeatedly that anima’ 
with septal electrodes characteristica™? 
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do those with electrodes in the pos- 
terior hypothalamus. With the self- 
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such response rate differences are 
equivocal. Some light was thrown on 
the question by a procedure in which 
the experimenter could vary the rate 
of stimulation (Valenstein & Meyers, 
1964). Rats were placed in a two- 
platform chamber in which brief trains 
of brain stimulation were presented 
when the animal was on the positive 
(stimulation) platform. As the posi- 
tive and neutral (no-stimulation) plat- 
forms were interchanged on a random 
schedule, the time spent receiving 
stimulation provided a useful measure 
of the animal’s orientation to the 
stimulus. 

Figure 1 presents the results of 
varying the stimulation rate to differ- 
ent reinforcing sites. Percentage of to- 
tal time which was spent on the posi- 
tive platform is plotted as function of 
stimulus intensity. The 50% line rep- 
resents chance. With hypothalamic 
stimulation there was no tendency for 
the animal to leave the positive plat- 
form at the fastest stimulation rates; 
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with septal stimulation, however, time 
on the positive platform decreased 
with the shorter interstimulus inter- 
vals. The septal animals maximized 
their time on the positive side of the 
chamber when the stimulus was de- 
livered at a rate approximating that 
achieved through self-stimulation. 

Several conclusions may be drawn 
from this data, but what is most rele- 
vant to this discussion is that animals 
receiving septal stimulation respond 
more slowly than those receiving hy- 
pothalamic stimulation because faster 
stimulation is avoided. The value of 
the reinforcer to the animal apparently 
changes following stimulation. Esti- 
mates of reinforcement strength based 
only on response rate are not justified 
as it appears that with septal stimula- 
tion (and perhaps other reinforcing 
areas) effects of the stimulation may 
persist. 

A similar point may be made with 
other data from our laboratory. The 
lever-pressing rate for septal stimula- 
tion alone and when combined with 
food reward on two different sched- 
ules of reinforcement was obtained. 
Animals were food deprived and re- 
sponse rates were obtained independ- 
ently for septal stimulation on a con- 
tinuous reinforcement schedule and 
food reward available at variable 1m- 
tervals. Then the two schedules were 
combined so that responses on a single 
lever were reinforced continuously 
with brain stimulation and at variable 
intervals with food. If reinforcement 
strength alone determined rate, re- 
sponse frequency would have increased. 
Figure 2 shows that actually the rate 
was lower than that obtained with 
brain stimulation alone probably be- 
cause of loss of responding time dur- 
ing eating. A more striking demon- 
stration was obtained when the animal 
was placed on a schedule requiring 2 
responses for each reinforcement 
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(fixed ratio). Although this schedule 
generates a fast rate of responding, 
combining the food reward with brain 
stimulation resulted in a rate slower 
than that obtained with food reward 
alone (Fig. 2). The argument that 
brain stimulation may have decreased 
the value of the food reward is miti- 
gated by the observation that the ani- 
mal always ate the food when it was 
available. In view of the results ob- 
tained on the dual platform with dif- 
ferent stimulation rates, it would seem 
that septal stimulation exerted a brak- 
ing action on response rate because 
faster stimulation was unacceptable. 

An experiment in which septal stimu- 
lation was administered at each con- 
tact with a drinking tube containing 
glucose also supports this interpreta- 
tion (Asdourian, 1962). Changes from 
base-line data indicated that the addi- 
tion of septal stimulation reduced the 
amount of glucose consumed. It was 
concluded that a “limiting mechanism” 
was operating which held the number 
of stimulations (and necessarily con- 
tact with the glucose solution) to 
about the pretest bar-pressing rate for 
brain stimulation alone. It should be 
apparent that an appraisal of the mo- 
tivational consequences of brain stimu- 
lation by determining whether the ad- 
dition of this reinforcement increases 
or decreases response rate for food 
may be misleading (Nielson, Doty, & 
Rutledge, 1958). 

Brain stimulation may also have 
aversive consequences (Delgado, Rob- 
erts, & Miller, 1954), but different 
techniques are required for measuring 
the strength of the aversion. A rate 
measure as used with positive rein- 
forcers is not possible as animals will 
not respond for aversive stimulation. 

owever, by changing the procedure 
So that stimulation is postponed for a 
fixed period of time following each re- 
Sponse, an avoidance (or escape) rate 
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may be obtained. To determine the 
significance of this response rate, it 
may be necessary to take into consid- 
eration the matter of acceptability of 
faster stimulation rates. In a recent 
study, stimulation was presented in 
trains of .5-second duration at the rate 
of one train per second and a lever 
press postponed stimulation for 4 sec- 
onds (Olds & Olds, 1963). Response 
rates to postpone stimulation indicated 
an avoidance tendency, while self- 
stimulation performance indicated the 
presence of positive reinforcement. 
With pure negative reinforcement the 
results were unambiguous, but with 
mixed approach-avoidance (ambiva- 
lent) effects interpretation was com- 
plicated. Some septal electrode place- 
ments, for example, were classified as 
ambivalent because the animals both 
self-stimulated and responded to post- 
pone stimulation. Considering the 
previous discussion it seems likely that 
with slower rates of stimulation these 
animals may not have terminated the 
stimulus. Olds and Olds were aware 
of this problem and wrote: 

In approach tests, the response rates of am- 
bivalent rats were never above one response 
every two seconds. Thus it appears that in 
this case, applying the stimulus too often 
has avoidance effects [p. 277]. 


Full appreciation of this problem places 
the “aversive” aspects of stimulation 
in the so-called ambivalent areas in a 
different perspective. Only stimula- 
tion at fast repetition rate is avoided. 
As we have little basis by which to 
evaluate the repetition rate of physio- 
logical stimulation, the motivational 
consequences of activation of such 
neural structures is questionable. 

It would appear that many of the 
difficulties of a rate measure would be 
eliminated by providing brain stimula- 
tion less frequently than with each re- 
sponse. With such methods, both the 
disruption of performance by motoric 
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side effects and the preference for 
slower stimulation rates would have 
less influence on response rate. In- 
deed, the demonstration that intermit- 
tent brain stimulation was capable of 
maintaining behavior was accompanied 
by the suggestion that this would mini- 
mize the influence of gross motor ef- 
fects of the stimulus on the response 
rate (Sidman, Brady, Boren, & Con- 
rad, 1955). In view of this advantage 
of intermittent reinforcement sched- 
ules, it might seem strange that regular 
reinforcement continues to predomi- 
nate. A partial explanation may be 
found in the difficulty of maintaining 
the behavior of many animals with 
only intermittent brain stimulation. 
Perhaps comparable is the observation 
that some animals which press a lever 
repeatedly for continuous reinforce- 
ment do not perform adequately in a 
simple maze or runway where rein- 
forcement rate is less frequent (New- 
man, 1961; Olds, 1956; Spear, 1962). 
There may be a species difference in 
this respect, as monkeys have been 
shown to respond stably on fixed ratio 
schedules requiring 100 responses for 
one reinforcement (Brodie, Moreno, 
Malis, & Boren, 1960). In the past, 
however, brain-stimulation studies with 
rodents have used only schedules pro- 
viding a high reinforcement rate which 
is in striking contrast to the routine 
testing with ratios of 100 or more with 
food reinforcement (Ferster & Skin- 
ner, 1957). 

Recently, however, Pliskoff and Haw- 
kins (1964), using the technique de- 
scribed above, have been able to obtain 
stable data from rats with schedules 
providing a low frequency of rein- 
forcement. It will be recalled that 
these authors use a procedure in which 
animals are required to press one lever 
which is reinforced on some schedule 
with the insertion of a retractable lever. 
The animal may then receive a brain 
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stimulation train for each response on 
the retractable lever up to some pre- 
determined number of presses before 
this lever is withdrawn. With this 
procedure it has proven possible to 
maintain behavior on the first lever 
with variable- interval reinforcement 
schedules up to VI 4 minutes and 
fixed ratio schedules up to FR 200. 
Although this method does not appear 
equally successful with all reinforcing 
neural sites, it does suggest that the 
response rates on the first lever may 
provide very useful information for as- 
sessing the value of the stimulation. 
Some caution in using response rate 
even with intermittent reinforcement 
should be observed. While response 
rate appears to reflect the incentive 
value of the stimulus, it may not simi- 
larly reflect internal states of the or- 
ganism which would be expected to in- 
teract with reinforcement value. With 
food reward, for example, it has been 
shown that response rate maintained 
by a variable-interval schedule of rein- 
forcement may be insensitive to depri- 
vation level. Also following repeated 
testing with fixed ratio schedules, re- 
inforcement may become associated not 
only with the response, but also with 
a particular rate of responding. Under 
these conditions, rate may become an 
essential part of the response an 
changes in reinforcement value may 
not be reflected in response rate (Sid- 
man, 1960). Response rate develops à 
tempo or rhythm and becomes increas- 
ingly stable, but at the same time it is 
likely to become less sensitive as 4% 
measure. The paradox is that until 
there is a satisfactory independent 
measure of reinforcement strength, it 
will be impossible to determine how 
faithfully response rate reflects rein- 
forcement value. With food reward, 
the assumption that reinforcement i 
creases with deprivation or quantity © 
food has common sense appeal. Wit 
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brain stimulation there are few con- 
venient guides as manifested by con- 
flicting opinion on the effect of in- 
creasing stimulus intensity on rein- 
forcement value (Hodos & Valenstein, 
1962; Reynolds, 1958). 

Tests which permit an animal to 
demonstrate a preference may provide 
a relative measure of reinforcement 
strength. These tests may consist only 
of presenting two reinforcing condi- 
tions simultaneously and recording 
some index of the animal's preference. 
Quantification of the strength of the 
preference may be accomplished with 
techniques which require that the ani- 
mal perform some work in order to 
change from the less preferred to the 
more preferred reinforcement. By ad- 
justing the amount of work to a point 
of equal preference, an estimate of the 

of preference may be achieved. 
Verhave (1963) has described some of 
the experimental variables and mathe- 
matical considerations applicable to 
this type of testing procedure when 
used with food reward. In general, 


Ég 
however, preference tests tend to be- 


come cumbersome when a large num- 
ber of stimuli are to be compared. For 
most experimental purposes, it would 
be more convenient if reinforcement 
value could be expressed in terms of a 
metric unit that would indica rela- 
tive position on a scale. Preference 
tests, however, may help to validate a 
method which does provide such infor- 
mation. 

Tests other than response rate may 
provide a useful unit of measure. One 
avenue which has been explored is 
based on a determination of how much 
an animal will overcome to obtain 
reward. Obstruction-box techniques 
(Warden, 1931) have been generally 
Tejected because of the variability of 

4 vior resulting from repeated elec- 
tric shock. Recently a test was de- 
scribed in which the Rober of lever 
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presses required for successive rein- 
forcement increased by a fixed ratio 
(Hodos, 1961). The measure of re- 
ward strength resulting from such a 
“progressive ratio test” is the number 
of unreinforced responses an animal 
will make before the behavior is ex- 
tinguished. This technique appears to 
be useful with food reward, but only 
preliminary information exists with re- 
inforcing brain stimulation (Hodos, 
1963). The method has the advan- 
tage of eliminating the problems in- 
herent with a response-rate measure 
and also provides a unit useful for 
comparative purposes. It will be im- 
portant, however, to determine if all 
self-stimulating animals perform reli- 
ably under conditions which provide 
only intermittent reinforcement. 
Another measurement problem about 
which much interest has centered con- 
cerns the changes in reinforcement 
value as a function of the duration of 
stimulation. When given control over 
stimulus duration, animals will termi- 
nate positive stimulation. This has 
been attributed to the excitation of a 
neighboring aversive neural system 
through a temporal summation of in- 
adequate stimuli (Stein, 1962a). The 
most frequently used method of ob- 
taining preferred duration permits the 
animal to hold a single lever down to 
obtain stimulation which is terminated 
when the lever is released (Bower & 
Miller, 1958; Stein, 1962a). This 
method yields very brief “preferred” 
durations, but it would appear that the 
influence of motoric side effects of the 
stimulation have not been considered 
sufficiently. Very different data are 
obtained when animals are not re- 
quired to hold the lever, but are free 
to move about the testing chamber 
until they press a second lever which 
terminates the stimulus. Direct ob- 
servation of the behavior suggests that 
the different results are due to the in- 
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Fic. 3. Comparison of average duration 
animals maintain brain stimulation when ob- 
tained with one-lever (open circles) and 
two-lever (filled circles) techniques. 


ability of the animals to inhibit motor 
reactions forcing them off the lever, 
which in the one-lever test terminates 
the stimulus. 

Figure 3 contrasts the results from 
these two methods with four animals 
receiving reinforcing hypothalamic 
stimulation (Valenstein & Valenstein, 
1963). It is evident that the preferred 
durations obtained by the two meth- 
ods are of different orders of magni- 
tude and are not to be explained away 
by the fraction of a second necessary 
to traverse the short distance between 
the levers. The interpretation sug- 
gested by one set of results does not 
seem to be appropriate to the other. 
The notion of activation of an aversive 
system through temporal summation 
may be appropriate to durations in 
the order of 1 second, but would re- 
quire a considerable extension of the 
neurophysiological data to be applied 
to durations above 10 seconds. 


3 Temporal summation is generally ex- 
plained by the addition of excitatory post- 


Error S. VALENSTEIN 


Speculation (Miller, 1957) that there 
exists a unitary neural system under- 
lying all reinforcement which is con- 
stantly being modulated by separate 
drive states has been the impetus for 
much research. Those neural areas 
which animals will stimulate them- 
selves are viewed as the best candi- 
dates for this unitary reinforcing sys- 
tem. The possibility that activation of 
drive states may alter the level of ex- 
citability of the neural tissue crucial to 
self-stimulation behavior is implicit in 
this theory. There is indeed some sup- 
port for this idea as it h been re- 
ported that ingestion of nutrients may 
change the activity of cells in hypo- 
thalamic nuclei (Anand, Chhina, & 
Singh, 1962; Anand, Dua, & Singh, 
1961). Food deprivation also has been 
shown to increase self-stimulation rate 
(Brady, Boren, Conrad, & Sidman, 
1957; Hodos & Valenstein, 1960; 
Olds, 1958), but these studies raise & 
methodological question that is basic 
to experiments of this design. Fig- 
ure 4 presents self-stimulation rates of 
animals when sated and hungry. It 
can be seen that there are striking dif- 
ferences under the two conditions, but 
the response rates are considerably be- 
low the level of 30 to 100 per minute 
commonly observed with reinforcing 
brain stimulation. It becomes neces- 
sary to ask whether the stimulation 
was actually reinforcing at the intensi- 
ties used to demonstrate these differ- 
ences. This question is especially criti- 
cal in view of the well-established re- 
lationship between food deprivation an 
activity level. Activity changes occu! 
not only with food deprivation, but 
with the administration of depressant 
synaptic potentials (EPSP). These poten 
tials reach a peak in a few milliseconds ane 
thereafter rapidly decay. Afferent su f 
threshold volleys, for example, are unan 
to generate an impulse unless the inte 
between volleys is less than 5 millisecone 
(Eccles, 1957). 
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and excitant drugs as well as with 
stimulation of some neural structures. 
For example, recently stimulation of 
the caudate nucleus of the cat (a sug- 
gested reinforcing site) has been shown 
to produce hyperactivity which with 
some testing procedures may be con- 
fused with reinforcement (Justesen, 
Sharp, & Porter, 1963). Before a rate 
change can be attributed to any direct 
effect on the neural substrate of re- 
ward, the influence of general activity 
level on performance must be parceled 
out as it has been shown that even 
in the absence of Stimulation, food-de- 
prived animals respond at significantly 
higher rates than sated animals (Hodos 
& Valenstein, 1960). 


THRESHOLD DETERMINATION WITH 
REINFORCING BRAIN 
STIMULATION 


There are several reasons why the 
determination of a stimulus intensity 
threshold is important with brain- 
stimulation experiments. Self-stimula- 
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tion behavior would be expected to be 
most sensitive to influence at low in- 
tensities. Presumably, this is true be- 
cause changes in the excitability of the 
nervous tissue are likely to have sig- 
nificant influence only if the stimula- 
tion intensity is not so high that it 
overcomes all differences in nerve cell 
responsiveness. The general accept- 
ance of this point of view can be seen 
in Stein's (1962c) statements, “drugs 
effective against depression should in- 
crease the ability of the brain to re- 
spond to positive reinforcement, either 
by directly stimulating or sensitizing 
reward centers, or by selectively in- 
hibiting aversion centers Ip. 298] ;” 
and “It would be hard to exaggerate 
the importance of using threshold- 
intensity currents for these tests Ip. 
” 


Threshold determination is also im- 
portant for charting the reinforcing 
conseqences of stimulation of different 
neural areas as it is reasonable to as- 
sume that the threshold reflects prox- 
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imity to the reinforcing site. In addi- 
tion, the threshold intensity provides 
some indication of the relative size of 
the neural field which must be excited 
in order to achieve effects. 

Actually the stimulus threshold has 
several possible meanings and it is es- 
sential that these different meanings 
be kept separate. Proceeding from the 
low end of the intensity scale upward 
we may speak of the cue or detection 
property of the stimulus, and the low- 
est intensity which can be sensed, di- 
rectly or via some mediating process, 
may be called the detection threshold. 
The animal perceives the stimulus and 
can be trained to respond to the brain 
stimulation as to any other discrimi- 
nable stimulus, but prior to learning, 
the stimulus is neutral and possesses 
no special reinforcing consequences. 
At higher intensities we may speak of 
a reinforcement threshold, which is the 
minimum intensity capable of main- 
taining some instrumental behavior to 
obtain or escape from the stimulus. 
With appropriate techniques it is pos- 
sible also to determine intensity pref- 
erences. This is the intensity the ani- 
mal selects with a procedure permit- 
ting self-regulation. In addition, there 
are a number of other kinds of reac- 
tions associated with stimulation for 
which thresholds may be determined. 
These include stereotype motor reac- 
tions, the occurrence of convulsions 
and physiological responses such as 
cardiovascular changes. An independ- 
ent variable may affect one threshold 
but not others, and unless the factors 
controlling behavior in a particular 
testing situation are determined, con- 
clusions may be misleading, 

The evidence that there is a range 
of intensities below the reinforcement 
value that may be detected comes from 
several directions. It was noted in our 
laboratory that when the intensity was 
switched to a low value, experienced 
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rats and guinea pigs stopped respond- 
ing after only one exposure to the 
stimulus. In contrast, if the stimula- 
tor was turned off, the animal typi- 
cally responded 15 or more times be- 
fore there was any interruption of a 
response rate. Evidently the low in- 
tensity stimulation was serving as a 
cue or signal for the beginning of a 
nonreward period. With no stimula- 
tion at all, the situation was more am- 
biguous. Recently, the quantitative re- 
lationship between the detection and 
reinforcement thresholds, has been in- 
vestigated by using intensities too low 
to maintain self-stimulation to signal 
the availability of reinforcing stimulus 
intensities (Campbell, 1963). 
Stimulation intensities in the rein- 
forcement range also have cue proper- 
ties. This was demonstrated in an ex- 
periment in which positive stimulation 
served as a conditioned stimulus for 
an avoidance response (Mogenson & 
Morrison, 1962). Results with avoid- 
ance conditioning also suggest that 
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structure provides a distinctive cue to 
the animal regardless of any other ef- 
fects associated with the stimulus 
(Nielson, Knight, & Porter, 1962). 
The distinction between detection 
and reward threshold is equally im- 
portant with aversive brain stimula- 
tion. In determining the threshold in- 
tensity for an aversive stimulus it was 
noted that the subjects (monkeys) 
were responding to the cue proper- 
ties of the stimulus (Boren & Malis, 
1961). In this testing procedure, an 
animal’s response reduced the intensity 
of an aversive stimulus that was other- 
wise increasing at the rate of one step 
each second. The monkeys started re- 
sponding when the stimulus reached a 
certain intensity and thereby prevented 
the stimulus from increasing above 
that point. It was first thought that 
this point indicated the aversive-stimu- 
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lus threshold, but with additional ex- 
perimentation it was recognized that 
as the intensity gradually increased, 
nonaversive stimuli were playing the 
role of signals which warned the ani- 
mal of the forthcoming aversive stimuli. 
That animals were responding to the 
warning signal rather than to the 
aversive stimulation was shown by de- 
creasing the maximal current. In this 
way it was possible to extinguish re- 
sponding to the warning signals by 
separating them from the truly aver- 
sive stimuli. , 

It can be seen that any procedure 
which employs a regular order of pre- 
senting stimulus intensities may dis- 
tort the reinforcement threshold. Esti- 
mates of the reinforcement threshold 
based on response rate are particu- 
larly likely to be distorted when the 
stimulus intensities are presented in a 
systematic pattern. With an ascend- 
ing order, response rate may increase 
when the stimulus is detected because 
this serves as a signal for donnern 

This is similar to 
well-established finding that with fixed 
intervals between reinforcements the 
response curve is “scalloped” (J func- 
tion) due to the acceleration of rate 
as the reinforcement time approaches 
(Ferster & Skinner, 1957). Response 
rate in the range of intensities between 
the detection and reinforcement thresh- 
olds may reflect anticipation of rein- 
forcement, but this is not equivalent to 
a response to a reinforcing stimulus. 
Anticipatory behavior may reflect gen- 
eral activity, as suggested above, and 
may also be influenced by a change in 
sensation threshold. As little is known 
concerning the mechanism by which 
animals detect the presence of the 
stimulus, it is evident that caution 
should be exercised before attributing 
a change in rate to a change in sensi- 
tivity of “reward” or “aversion” cen- 
ters. 
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In contrast to a systematic sequence, 
a random sequence of intensity pres- 
entation yields data that are quite vari- 
able, as response rate produced by a 
given stimulus intensity is influenced 
by the intensity of the preceding stimu- 
lus. If the preceding intensity is high 
the response rate is likely to be sig- 
nificantly lower than that which would 
have been obtained with less intense 
preceding stimuli. Some preliminary 
data from our laboratory indicate that 
such effects may persist for as long as 
30 minutes. It is not clear whether 
these results are due to an emotional 
reaction to the contrast in reinforce- 
ment strength or to some change in 
the responsiveness of the neural tissue 
following intense stimulation. We will 
raise this problem again, but at this 
point we would indicate that while a 
random presentation of stimulus in- 
tensities eliminates anticipatory be- 
havior, it has the drawback of increas- 
ing variability. 

Many of the problems discussed are 
exaggerated by a testing procedure 
which presents a series of intensities 
during a single testing session. The 
influence of both the cue properties of 
the stimulus and contrast effects due 
to differences in reinforcement strength 
are minimized when each session of- 
fers only one stimulus condition. Even 
under these conditions, however, it is 
difficult to make precise estimates of 
the reinforcement stimulus threshold 
from curves depicting rate as a func- 
tion of stimulus intensity. The main 
problem is that criteria for distinguish- 
ing between sub- and suprathreshold 
performance are difficult to establish. 

Figure 5 illustrates this difficulty 
and also compares results based on re- 
sponse rate with a time measure ob- 
tained from the two-platform test 
previously described. The similarity 
of the curves obtained with the two 
techniques supports the position that 
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the same phenomenon was measured. 
However, it is instructive to compare 
the relative ease of determining the 
reinforcing stimulus threshold. With 
the two-platform test the positive 
(stimulation) and neutral platforms 
are switched on a random sequence 
throughout each test. As 50% of the 
total testing time on the positive plat- 
form. represents chance expectancy, 
performance deviating from chance 
may be evaluated statistically. With 
self-stimulation rate there is no con- 
venient standard against which to 
evaluate a specific rate. Response lev- 
els prior to stimulation experience 
(operant behavior) do not provide a 
useful criterion as it has been shown, 
for example, that even with one food- 
reinforced session, rates may consist- 
ently remain above previous operant 
levels (Segal, 1962). Also, after ex- 
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perience with reinforcing brain stimu 
lation, we have noted that response 
rate remains significantly above “op- 
erant levels” during extinction trials. 

Figure 6 illustrates that the re- 
sponse rate during the first two ex- 
tinction tests averaged over 500% of 
the previous operant level. By the 
tenth test the rate had appeared to 
stabilize at approximately 200% of op- 
erant level. On the eleventh trial, the 
introduction of a 5-minute period of 
brain stimulation at the end of the 20- 
minute extinction session caused the 
rate to rise even in the nonstimula- 
tion period. Rate gradually declined 
with successive tests presumably as 
the animal learned that reinforcement 
was available only after 20 minutes. It 
would be anticipated that if reinforc- 
ing stimulation were introduced at 
random intervals, the rate during the 
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extinction periods would have re- 
mained even higher. 

This tendency toward higher re- 
sponse levels following a history of 
association with reinforcement illus- 
trates the difficulty in estimating the 
reinforcement threshold precisely with 
a rate measure. It may be possible to 
obtain a daily operant level in order to 
compensate for this tendency toward 
increased response rates. For some 
purposes this method may be adequate, 
but in actual practice it will often be 
difficult (without an extensive investi- 
gation) to decide whether operant be- 
havior should be sampled before, dur- 
ing, or after each test session. 

After the animal has become familiar 
with the procedure, the reinforcement 
threshold obtained with the two-plat- 
form technique does not change with 
additional experience (Valenstein & 
Meyers, 1964), but there remains a 
possibility of an interaction between the 
cue and reinforcement properties of 
the stimulus. Animals may seek out 
stimulation at subreinforcement intensi- 
ties as a result of a generalization gradi- 
ent based on the cue properties. This is 
an empirical question which may be an- 
swered by offering animals stimulation 
only at the questionable intensities 
without any additional experience with 
higher intensities. Under these condi- 
tions extinction would be expected if 
stimulation was not reinforcing in its 
own right. 

Other methods have been used to 
assay the excitability of the neural sub- 
strate underlying reinforcement. Most 
noteworthy are two interesting pro- 
cedures which permit animals to self- 
regulate the intensity. The original 
technique introduced a method for de- 
termining the preferred intensity of 
stimulation (Stein & Ray, 1959). Ani- 
mals were trained to press either of 
two levers to receive a brief reinforc- 
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ing stimulus, but responses on one 
lever produced stepwise increments in 
intensity, while the other lever pro 
duced equal decrements in intensity, 
Current levels were started at zero 
and a trained animal increased the 
current level by responding on the ap- 
propriate lever and then by alternating 
between levers, maintained the inte 
sity around a “preferred” level. This 
technique was apparently abandoned 
because many animals did not provide 
stable data. Current regulation vas 
more reliable with posterior (hypo- 
thalamus and midbrain tegmentum) ~ 
electrode placements, while rost 
sites (septal area) produced poor 
regulation as animals often increased 
the intensity until convulsions OC- 
curred. In spite of this shortcoming, 
several interesting facts en ged. 
cording to Stein and Ray (1959), pre- 
ferred levels, for example, were ust- 
ally higher than “experimenters would 
care to assign under the conventional 
fixed-intensity procedure. Exagget- 
ated and even violent motor activity 
was often produced by the intensities 
selected . . . [p. 571].” Motor activ- 
ity at the preferred intensity would be 
expected to depress self-stimulation 
rate. This finding further questions 
the validity of a rate measure and sup- 
ports the observation that animals 
often prefer very high stimulus inten- 
sities over more moderate intensities 
which may produce higher rates of re- 
sponding. j 
The second self-regulatory technique l 
has been explored more extensiv Y 
(Stein & Ray, 1960) and has yield 
more stable results and considerably 
more information particularly with re- 
spect to the action of drugs. This 
method also employs two levers, but 
brain stimulation is received as a co 
sequence of pressing only one of them. 
With each successive stimulus presen 
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tation, the intensity is decreased in 
small steps, but at any time the ani- 
mal may reset the intensity to the ini- 
tial high level by pressing the “reset” 
lever. 

The step at which the animal resets 
the intensity tends to be reliable and 
sensitive to stimulus parameters and 
administration of drugs. It does not 
represent, however, the lowest inten- 
sity that will provide reinforcement 
(Stein, 1961). Figure 7, for example, 
illustrates that high self-stimulation 
rates may be obtained at resetting in- 
tensities. One possible explanation is 
that the cue properties of the stimulus 
enable the animal to “anticipate” a 
drop in reinforcement value. Several 
alternative explanations are possible, 
but it is important to stress that this 
technique yields a “threshold” that is 
significantly higher than the reinforce- 
ment threshold as it is commonly de- 
fined, 

There is one shortcoming of this 
Procedure and one major question of 
interpretation which should be consid- 


ered in view of the quantity of data 
derived from this approach. As with 
the self-regulatory method for prefer- 
ence threshold, this procedure does 
not appear to be useful with all rein- 
forcing sites. To date, published stud- 
ies include only electrode placements 
in the hypothalamus and tegmentum. 
In our laboratory attempts to obtain 
reliable data with reinforcing telen- 
cephalic sites have been unsuccessful. 
With septal placements, for example, 
animals tend to press more and more 
slowly with decreases in current level 
and often self-stimulation behavior is 
extinguished before the intensity is re- 
set. This may be explained by the rela- 
tively rapid extinction reported with 
septal stimulation (Seward, Uyeda, & 
Olds, 1959), but the inability to use 
this procedure with many reinforcing 
areas limits its value particularly as a 
method for locating action sites of 
drugs. 

Interpretively, changes in resetting 
level following administration of drugs 
have been attributed to either changes 
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in excitability of the specific brain 
structures stimulated by the electrode 
or more conservatively to a modifica- 
tion of a positive reinforcing system. 
In order to understand any change 
brought about through the introduc- 
tion of an experimental variable, it is 
important to determine what factors 
control the behavior in the test situa- 
tion under analysis. It has been shown 
that the resetting intensity is sensitive 
to stimulus parameters, and even shifts 
from stimulation rates of 25 to 33 cycles 
per second may be discriminated (Stein 
& Ray, 1960). However, other varia- 
bles in addition to the stimulus param- 
eter-neutral excitability dimension may 
influence behavior in this situation. 

In a test requiring repetitive acts, 
behavior generally develops a rhythmic 
pattern. In this particular test, for 
example, it may be observed that ani- 
mals emit a regular number of re- 
sponses on the stimulation lever, then 
press the reset lever and repeat this 
sequence over and over again. Any 
experimental variable that modifies 
this pattern will have an effect on the 
resetting intensity. Drugs such as am- 
phetamine, pentobarbital, chlorproma- 
zine, and reserpine which have been 
shown to alter resetting intensity have 
a number of both central and periph- 
eral effects which may modify this be- 
havior pattern other than through a 
presumed sensitization or depression 
of a reward system. The finding that 
amphetamine lowers the resetting in- 
tensity, for example, is viewed as re- 
sulting from facilitation of the hypo- 
thalamic reinforcing system (Stein, 
1962b). The possibility that ampheta- 
mine induced a nonspecific motor ac- 
tivation was evaluated by testing ani- 
mals in a conventional lever-press- 
ing situation. Amphetamine increased 
response rates to a “subthreshold” 
stimulus, but there was no increase in 
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rate if the stimulus was not presented 
(Stein, 1964a.) Nevertheless, the con- 
clusion that amphetamine effects the 
“reinforcing system” may be prema- 
ture. Uyeda and Fuster (1962), for 
example, have tested the tachistoscopic 
performance of monkeys and con- 
cluded that amphetamine improved ac- 
curacy and shortened reaction time. 
As similar results were obtained from 
electrical stimulation of the mesen- 
cephalic reticular formation the au- 
thors conclude that amphetamine has 
a reticulotrophic action. These results 
raise the possibility that it is the per- 
ception of subthreshold stimuli which 
may be influenced rather than the re- 
inforcement process. In any case, à 
comparison of the conclusions from 
the two sets of experiments indicate 
the danger of attributing changes in 
performance to any specific or as- 
sumed neural system. n 
Amphetamine has been shown to in- 
crease the tendency to respond in 
many different experimental situations 
with both positive and negative rein- 
forcement (see Carlton, 1963). In one 
illustrative experiment (Carlton, 1961) 
rats were required to alternate be- 
tween two levers in order to obtain 
food reward. Following administra- 
tion of amphetamine, “perseverative 
tendencies were enhanced. That 1$, 
animals tended to repeat responses On 
the same lever although they received 
no food unless they alternated. With 
the self-determination method of ob- 
taining brain-stimulation threshold ani- 
mals were required to switch from one 
lever to the other in order to reset the 
stimulus intensity. Any “persever@ 
tive” tendency or enhancement of re- 
sponding would cause the animal to 
drive the intensity down to a lower 
level. It would appear gratuitous a 
this time to attribute any change in 
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resetting level to any alteration of 
specific neural elements. 


NEURAL INTERACTION AND REIN- 
FORCING BRAIN STIMULATION 


Limbic neural pathways have been 
extensively described, but the determi- 
nation of the behavioral significance of 
these pathways lies outside the neuro- 
anatomical discipline. It is necessary 
to have a behavioral measure which 
can be attributed to some neural struc- 
ture in order to evaluate the nature 
of the interaction between structures. 
The self-stimulation phenomenon ap- 
pears to be particularly convenient for 
this type of analysis as, at least as a 
first approximation, behavior may be 
considered to be under the control of 
the neural structures surrounding the 
electrode tip. 

Through activation (stimulation) or 
deactivation (ablation or anesthetiza- 
tion) of one of two connecting struc- 
tures, an indication of the interaction 
between these structures should be re- 
vealed. Using this basic approach 
some interesting interaction effects be- 
tween specific limbic systems have 
been demonstrated, but the significance 
of the interaction is not always clear. 
For example, stimulation of the cau- 
date nucleus of a monkey produced a 
high and stable rate of bar pressing at 
the start of a session. However, if 
stimulation of either the hypothalamus 
or amygdala preceded the caudate 
stimulation, lever pressing was signifi- 
cantly lower and less stable (Brady, 
1961). To interpret such data we 
must ask whether these findings re- 
sult from a_ specific interaction be- 
tween these nuclei. With food rein- 
forcement, for example, it has been 
observed that when an animal “antici- 
pates” a particular reward a change in 
reward value will result in an eleva- 


tion or depression of activity depend- 
ent upon the relationship of the two 
rewards, It has been suggested that 
this may result from an emotional re- 
action to the contrast (Crespi, 1944). 
It is not unreasonable to expect a 
similar effect with contrasting rein- 
forcing brain stimulation, and indeed 
Hawkins and Pliskoff (1964) have 
reported such effects. Presumably 
contrast effects also have a neurologi- 
cal basis, but until the mechanisms of 
emotional reactions are better under- 
stood, interpretations which imply spe- 
cific interactions between brain sites 
must be regarded cautiously. 

Lesion technique may be used to de- 
termine whether a reinforcing site may 
be modified by eliminating the influ- 
ence of another area. In one experi- 
ment the dependence of reinforcing 
tegmental stimulation on any crucial 
activity of the septal area or fornix 
system was tested (Ward, 1960). It 
was concluded that there was no such 
dependence, as rats with large lesions 
in the septal area continued to self- 
stimulate for tegmental stimuli. Pro- 
viding the possibility of threshold 
change is ined, it would seem 
safe to draw conclusions from such 
“negative” results. However, where 
self-stimulation behavior is eliminated 
or decreased, it is necessary to deter- 
mine whether the reinforcing proper- 
ties of stimulation or some perform- 
ance capacity has been modified. In 
a recent report it was shown that there 
were deficits in bar pressing for cingu- 
late stimulation following hypothalamic 
lesions (Coons & Fonberg, 1963). 
The conclusion that the hypothalamus 
mediated reward obtained from cingu- 
late stimulation must be evaluated to- 
gether with the possibility that hypo- 
thalamic lesions affected performance 
sufficiently to cause a decrease in 
lever-pressing rate. As a minimum 
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teraction (see text). 


control it would be desirable to have 
a measure of the constancy of per- 
formance maintained by a reinforcer 
independent of brain stimulation. 

Where results suggest an inhibitory 
action of one area on another it is also 
important to determine whether this 
action is specific or of a general na- 
ture. An example may serve to illus- 
trate this point. There is a consider- 
able body of information supporting 
the view that feeding behavior is con- 
trolled by the interaction of a lateral 
hypothalamic “feeding center” and a 
medial hypothalamic “satiety center” 
(Anand & Brobeck, 1951). As stimu- 
lation in the lateral area also provides 
positive reinforcement while stimula- 
tion in the medial area appears to be 
aversive, it has been suggested that 
there may exist a similar interaction of 
reinforcing systems (Hoebel & Teitel- 
baum, 1962). Indeed, it was demon- 
strated that when mild ventral stimu- 
lation was superimposed upon stimu- 
lation of the lateral area the animals 
stopped lever pressing. However, as 
medial stimulation is generally consid- 
ered to have some aversive properties 
it is not surprising that the addition of 
a negative reinforcement should result 
in a lowering of response rate. It 
would be important to determine the 
influence of medial stimulation on 
other reinforcing brain sites (as well 
as with positive reinforcers other than 
brain stimulation) before any direct 
relationship is inferred. 

Although studies of neural interac- 
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tion that use electrical stimulation in 
contrast to lesion methods have the 
advantage of reversible effects, there 
are technical problems which raise 
some questions about interpretation. 
When knowledge has been obtained 
about the response of an animal to 
stimulation of a given neural area 
(Site A) there may be interest in 
changes in response characteristics re- 
sulting from stimulating a second 
neural area (Site B). It is hoped that 
some information pertinent to the na- 
ture of the neural interaction may be 
inferred by simultaneously stimulating 
the two areas. Not often considered is 
the possibility that the interaction may 
be electrical rather than neural. We 
have noted, for example, that effects 
characteristic of Site A (e.g., stereo- 
type motor responses) may be trig- 
gered by stimulating Site B if a low, 
subthreshold stimulus is also presented 
to Site A. This may be true even 
when the stimulators are electrically 
isolated from one another and the elec- 
trodes located some distance apart. 
The fact that this effect may be pro- 
duced when a number of different 
areas are substituted for Site B sug- 
gests that these results should not be 
attributed to a neural interaction. We 
do not at present understand the 
mechanism involved, but could sug- 
gest the possibility that overlapping 
electrical fields which are ineffective 
by themselves may interact in one of 
several ways to produce an observable 
result. In any case the problem may 
be circumvented. In our laboratory 
we are using an electronic gating Cit- 
cuit which rapidly opens and closes 
each electrode-stimulator pathway to 
prevent nonneural interactions. Fig 
ure 8 illustrates this method of “simul- 
taneously” stimulating two sites. Both 
Stimulation A and B consist of bi- 
phasic pulse pairs presented at the 
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rate of 100 per second, but each cir- 
cuit is opened only during the presen- 
tation of a pulse pair. Other solutions 
are possible, but when simultaneous 
stimulation is used to study neural in- 
teraction, awareness of the possibili- 
ties of nonphysiological interactions 
should be indicated. 

In addition to investigation of inter- 
action between specific neural struc- 
tures, other studies have dealt with the 
interaction between systems. Recently 
suggestions have appeared in the lit- 
erature that an aversive-reinforcing 
system may inhibit the positive rein- 
forcing system. In one experiment, 
rats placed in a shuttle box were per- 
mitted 7.5 seconds after the presenta- 
tion of a tone to avoid aversive brain 
stimulation by crossing to the opposite 
side of the box (Stein, 1964b). Ani- 
mals were slow to learn the avoidance 
response under these conditions and 
typically waited until receiving the 
first stimulation before leaping to es- 
cape. However, if a “priming” stimu- 
lation of a positive area was presented 
with the onset of the warning tone, the 
number of successful avoidance re- 
sponses increased significantly. 

How can these interesting results be 
interpreted? One way is to postulate 
an interaction between the positive and 
aversive neural systems. Such an in- 
teraction has been hypothesized by 
Olds and Olds (1962) with reference 
to positive lateral hypothalamic and 
aversive tegmental stimulation: 


the termination of the tegmental stimulus 
will result in a release phenomenon in the 
lateral hypothalamic area. That is, there 
will be an augmentation of neural activity 
in the lateral hypothalamus on termination 
of the supposed negative reinforcing stimu- 
lus. Now this augmentation may be the 
sole prerequisite of positive reinforcement 
[p. 809]. 


According to this theory, avoidance or 
escape behavior is not reinforced di- 
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rectly by the elimination of the noxious 


With respect to the performance of 
in the shuttle-box situation de- 
above, it has been suggested 
the priming positive stimulation 
the threshold in the hypothala- 
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ypothalamic system. 
An alternative explanation emerges 
from an examination of the somatic 
concomitants of the stimulus. It can 


pears to activate a “freezing” or back- 
ward movement. 
competing responses (motor inhibition 
and facilitation) by stimulation of dif- 
ferent limbic structures has been the 
subject of a number of studies (see 
Kaada, 1951). Behavioral implications 
of these findings have been studied by 
McCleary (1961). Where the envi- 
ronment is sensed by cephalad recep- 
tors, the evolutionary survival value 
of such behavior patterns would tend 
to result in their perpetuation. Nox- 
ious stimuli signaling danger elicit 
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withdrawal patterns; positive stimuli 
elicit approach patterns. 

We are reminded of Guthrie's (1935) 
advice: To train a dog to jump 
through a hoop, the effectiveness of 
punishment depends on where it is ap- 
plied, front or rear [p. 160].” Re- 
cently, it was demonstrated that run- 
way performance could be either fa- 
cilitated or inhibited depending upon 
whether shock was delivered to the 
hind or forepaws (Fowler & Miller, 
1963). It has already been noted that 
recent evidence has suggested that in- 
teroceptive as well as exteroceptive 
stimulation can directly determine re- 
sponse characteristics. 

The motor patterns which are trig- 
gered by positive and negative stimu- 
lation are antagonistic. The warning 
tone (conditioned aversive stimulus) 
presented to the rat in the shuttle box 
elicits motor patterns which interfere 
with the forward movement necessary 
for successful avoidance. The “prim- 
ing,” positive stimulation initiates a 
forward movement, which improves 
avoidance performance in this situa- 
tion. In agreement with this view is 
the observation that the positive stimu- 
lus antagonized the “freezing” reac- 
tion to the warning signal and per- 
mitted the avoidance response to get 
started (Stein, 1964b). 

It is true that if the motor responses 
are antagonistic in a sense the under- 
lying neural process may also be 
viewed as antagonistic. There is, 
however, an important distinction to 
be borne in mind. The antagonism of 
motoric reactions simply recognizes the 
fact that an animal can not be simul- 
taneously immobilized and moving 
forward. This does not necessitate 


For theoretical presentation of processes 
underlying approach and withdrawal mecha- 
nisms the reader is referred to Schneirla, 
1959, 
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any direct interaction between central 
neural processes. Explanatory theo 
ries should take into consideration 
other alternatives as well as hypothe- 
ses that imply interactions within the 
central nervous system. 


CONCLUDING REMARKS 


The paper has dealt with problems 
of measuring the reinforcing conse- 
quences of brain stimulation. In illus- 
trating problems there is always the 
danger of not viewing the progress in 
proper perspective. Actually much of 
the work cited has been of a pioneer- 
ing nature, often extremely rich in hy- 
potheses and frequently very creative 
methodologically. Subsequent work has 
attempted precision, but often with the 
rough implements of the frontier. Con- 
clusions have gone beyond a general 
statement of possible usefulness of a 
technique to claims that this drug has 
this effect for this reason. It there- 
fore seems appropriate and important 
at this time to examine the methods 
used to reach these conclusions. 

Studies of the reinforcing conse- 
quences of stimulation are likely to 
make significant advances in our un- 
derstanding of the physiology of such 
familiar chapter headings as motiva- 
tion, emotion, drive, instinct, reim- 
forcement, learning, and many others. 
It is most important that we travel @ 
road which will permit the accumula- 
tion of valid conclusions which are 
meaningful beyond the idiosyncratic 
conditions of a particular experimen! 
method. The present paper has not 
constructed any freeways. This has 
been an attempt to place caution signs 
where existing roads are dangerous 
and road blocks where progress 18 not 
possible. For the theoretician who at- 
tempts to integrate recent findings in 
this area: 


PROCEED AT YOUR OWN RISK ! 


MEASUREMENT PROBLEMS WITH BRAIN STIMULATION 
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ANALYZING THE COMPONENTS OF 
CLINICAL INFERENCE’ 


KENNETH R. HAMMOND, CAROLYN J. HURSCH, axo FREDERICK J. 
University of Colorado 


The purpose of this paper is to analyze the components of clinical int 

ence within the framework of Brunswik's lens model by means .[ 
multiple-regression analysis, 2 parallel studies of clinical psychologists, 
the performance of Ss in a quasi-clinical task, and the performance of 


Two sharply contrasting attitudes 
toward the psychology of inference 
may be observed within psychology. 
On the one hand, clinical psychologists 
behave as if there were virtually no 
limit to their capacity for inference 
making. Indeed, the general assump- 
tion, only slightly tarnished, is—the 
more data on which to base the 
inference, the better the inference is 
likely to be. On the other hand, ex- 
perimental psychologists, particularly 
learning theorists, appear to believe 
that man’s capacity for inference 
making is hardly more than a brief 
step removed from none at all. Where 
the truth lies, however, may be 
learned from the inferential behavior 
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of the clinician himself, for clinical 
psychology provides an excellent re 
search site for the study of man as 
inferring organism. 

This paper studies clinical inferen 
within the framework of probabi 
functionalism, with Brunswik's le 
model (1955, 1956) as the point of 
departure. The analysis is based on 
the application of multiple-regressiom 
technique to the lens model as imi- 
tially set forth by Hammond (1955), 
and as recently outlined in further 
detail by Hursch, Hammond, 
Hursch (1964). 


Lens MODEL AND MULTIPLE- 
REGRESSION ANALYSIS 


When the process of clinical in 
ence is examined in terms of B 
wik's lens model (see Fig. I), 
application of multiple-regressio® 
analysis is straightforward. : 
each half of the lens can be descr! 
in terms of the value of the mult 
correlation coefficient (hereafter 
ferred to as R, for the environment; 
stimulus conditions, and R, for 
subject). The R, describes the 
tion between the cues and the va 
to be estimated, and the R, describe 
the relation between cues am 
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„110 
Fic. I. Brunswik s lens model. 


subject's responses. Thus, the degree 
of linear determinacy in each system 
can be calculated, and comparisons 
can be made between the environ- 
mental and organismic systems. 
Second, the specific relation be- 
tween each cue and the variable to be 
estimated (ecological validities) as 
well as the relation between the cues 
and the subject's response (utilization 
coefficients) can be measured in terms 
of correlation coefficients. Third, the 


correlations (more precisely, the sum 
of the product of the differences be- 
tween ecological validities and utiliza- 
— coefficients and 3 

tween the respective weigh 
can also be calculated. Such a calcu- 
lation provides an index of the extent 
to which the subject uses each cue 
relative to the validity of the cue- 

Fourth, a partial correlation be- 
tween the subject's judgment and the 
variable to be estimated (with the 
linear variance of each system elim- 
inated) may be calculated. 


achievement (correlation between the 
's estimate the variable 
estimated) and the statistical com- 
ponents described is set forth 
in the equation below (see Hursch et 
al., 1964, for a full development of the 
equation): 
R. + R. — L 
3 


+ evil — RA — RY 


R, = the multiple correlation be- 
tween the cues and the vari- 
able estimated 

R, = the multiple correlation be- 
tween the cues and the sub- 
ject’s judgments 

Xd = the sum of the products 
(re — ru) Ba — 8.0 where Te 
= the correlation between 
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cue; and the variable esti- 
mated, r,, = the correlation 
between cue; and the sub- 
ject’s judgment, Be = the 
beta weight for the correla- 
tion between cue; and the 
variable estimated and 8. 
= the beta weight for the 
correlation between cue; and 
the subject’s response 

C = the correlation between the 
variance unaccounted for by 
the multiple correlation in the 
ecology and the variance un- 
accounted for by the multiple 
correlation in the subject's 
response system 


Clinical Study I 


Within the course of comparing the 
predictive efficiency of the multiple 
regression equation with that of 
clinical judgment, Grebstein (1963) 
compared the performance of five 
Naive clinicians (graduate students 
who had completed an introductory 
course in Rorschach administration 
and scoring), five Semisophisticated 
clinicians (graduate students who had 
completed the above training and 
were also second-year Veterans Ad- 
ministration trainees), and five So- 
phisticated clinicians (professional clin- 
ical psychologists with the PhD and 5 
years of experience) whose task was to 
estimate 30 patients’ (Wechsler-Belle- 
vue) IQs using 10 cues from their 
Rorschach (Beck) psychograms. 
Grebstein (1963) concluded that 
“ability to judge IQ from Rorschach 
psychograms does not improve with 
additional clinical experience [p. 
130].” 

Irrespective of the substantive na- 
ture of Grebstein’s results, his study 
provides an excellent site for the 
examination of the heuristic value of 
the statistical components discussed 
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above. If it can be demonstrated 
that these statistical components are 
useful in this case, they should be 
generally useful. The present study, 
therefore, examines the performance 
of each of Grebstein’s groups of 
clinicians in terms of the components 
of the above equation. Before com- 
paring the performance of the three 
groups with one another, however, it 
will be worthwhile to evaluate the 
absolute level of their performance. 


Evaluation of the Performance of Greb- 
stein’s Clinicians 

In Grebstein’s (1963) tudy the 
mean correlations of predicied IQ with 
actual IQ for the Naive, Semiso- 
phisticated, and Sophisticated Judges 
were .50, .65, and .68, respectively. 
While these correlations were found 
to be significantly different from zero, 
they fall far short of 1.00. Thus 
Grebstein concludes, ‘Experienced 
clinical psychologists can make accu- 
rate estimates of intelligence from 
Rorschach summaries, but there is 
room for improvement Lp. 132].” The 
question arises, however, as to how 
much room for improvement exists. 
The following section will demonstrate 
that the limits of achievement will de- 
pend upon the relation between char- 
acteristics of the test situation and the 
characteristics of the response system 
of the clinician. l 

Limits of Achievement. The prin- 
cipal aim of Hursch et al. (1964) was 
to examine six different relations be- 
tween the statistical properties of the 
environment, or task, and the sta- 
tistical properties of the response 
system in order to discover the limits 
of achievement in each case under 
various conditions. Grebstein’s study 
falls under Case VI, in which it is 
shown that achievement Ce), the 
correlation between the clinicians 
estimate of IQ and the patient’s IQ, 
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must lie between: 
R. — E (Be, — Ba) eg tu) 
— D 
-CVA -RAURA Sr. 
< R? HR? 2G. Bo) Fated 
= ? 
+CV ZRA (1) 
Thus, the limits of achievement for 
Sophisticated Judge 5 in Grebstein's 
(1963) study are given by the follow- 
ing statistical properties of the task 
and his response system: 
R2 = .63 
R? = 85 
E (Be Fi 8.) (re; F 7. 0 = .264 
and since the limits of C are +1, then 
the limits of ra are: 


63 + 85 —. mans 
SS — VT (45) 


< 63 + .85 — .264 


=f. = 2 
+ VGS) 
608 — 234 S re < .608 + 234 
374 S Ta S842 


Therefore, Sophisticated Judge 5 
whose achievement, fa, Was actually 
.68, would have reduced his achieve- 
ment to .37 if the residual variance of 
his response system had been perfectly 
inversely correlated with the residual 
aspects of the Rorschach-IQ system, 
and would have reached .84, had this 
correlation been positive and unity. 

If the correlation between the two 
residual variances had been exactly 
zero, then: 

ta = ——— —— 
2 
63 + .85 — -264 
2 


.61 
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TABLE 1 


MULTIPLE-REGRESSION COEFFICIENTS FOR 
Ixpivpuar Juposs axo Geoure 


Naive | Fen- at- | Sophistirated 


905 951 19 
982 898 863 
871 951 928 
BM 955 930 
979 852 870 
905 9 919 


z 


‘ote. —Kruskal-Wallis J — Sit = me. 
* Based on data from Grebstein (1965). 


which is not far from his actual 
achievement, (re = .68), and which is 
to be expected in view of the fact that 
his C was fairly low, .32. If, then, 
Sophisticated Judge 5 was not ex- 
pected to develop a substantial value 
of C (and such an expectation implies 
a great deal), his achievement was 
very close to the maximum he could 
have achieved under the circum- 
stances. This point will be developed 
further as each component is discussed 
in detail. What is to be remem! 

is that the upper limit of achievement 
is not uncritically to be assumed to be 


unity. 


Differences between Naive, Semisophis- 
ticated, and Sophisticated Clinicians 


Tables 1, 2, and 3 show (a) that 
clinicians with varying amounts of 
training and experience did not differ 
with respect to R,, the linearity of the 
response process, (b) that they did not 
differ with respect to C, the correlation 
between the residual variance in the 
Rorschach-IQ system and residual 
variance in the subject’s response 
system, but (c) that they did differ 
with respect to Xd, the degree of 
match between ecological validities 
and utilization coefficients. Each of 
these results is discussed in turn 
below. 


44) 


Linearity (K) 


TE 


to 
Table 1) indicates that they 
differ in terms of the l 
response systems. The 
fact for the present discussion is 
all groups are highly linear. M 
over, they are more linear than 
Rorschach-IQ system of relations 
(K. = .79). The median R for the 
Naive clinicians is .905; for the Semi- 
sophisticated clinicians, the median 
R = .951; and for the 
clinicians, the median R = .919. The 
rank order correlation between lin- 
carity (R) and achievement C.) is 


i 


F f 


relation would be just equal to R., or 
79. (See Case 111/1, Formula 17, 
in Hursch et al., 1964.) The mean 
achievement (r = .68) of the Sophis- 
ticated clinicians as computed by 
Grebstein (1963, p. 129), was quite 
close to this, as might be expected 
from the high linearity of the clinic- 
ians. On the other hand, the multiple- 
regression technique, in which the 
correlation between the cues and the 
estimate is exactly 1.00, produced an 
achievement ra of only .56 on a cross- 
validation sample. The difference 
between the achievement of the 
clinicians and the multiple-regression 
technique raises an interesting point 
which has received considerable par- 
tisan discussion (Holt, 1958; Meehl, 
1954; Sarbin, Taft, & Bailey, 1960). 
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Each procedure is discussed below in 
terms of the linear process. 

Linearity and Clinical Prediction. 
Because the clinician's linearity is ex- 
tremely high, about .90, for the 
moment we shall ignore this slight 

from unity and assume 
that they were indeed perfectly linear 
(K. = 1.00). And because the 
achievement (ra = .68) of the com- 
posite Semisophisticated and Sophis- 
ticated clinicians sufficiently approxi- 
mates the maximum attainable (.79) 
if they had been perfectly linear, it is 
also plausible to assume that the 
clinicians achieved the maximum ac- 
curacy an K, of 1.00 and an R, of .79 
would permit. If the clinicians had 
actually performed according to these 
assumptions (which are close to the 
fact), how would such performance be 
explained? 

If R, is taken to be 1.00, then it 
would have to be concluded that a 
linear, additive rule was being used. 
What would such a rule consist of? 
One answer is that the clinicians were 
either systematically applying an in- 
ductive rule based on experience, or 
were systematically applying a rule 
taken from a textbook. There is some 
evidence that the Sophisticated clini- 
cians were doing the former, and the 
Naive clinicians the latter. The 
evidence comes from an inquiry into 
the specific mismatches between eco- 
logical validities and utilization coeffi- 
cients in Grebstein's study. 

When the differences between the 
utilization coefficients and ecological 
validities of the three groups are 
examined with respect to the 10 Ror- 
schach factors under consideration, 
it becomes apparent that the Naive 
clinicians misused one cue, Beck's 5, 


Beck (1946) described the relation be- 
tween Z and intelligence as follows: “it (Z) 
is an index of the intellectual energy as suc 
irrespective of the kind of intelligence that 


other cues. The 
ological correlation between Z and 
IQ in Grebstcin's sample is .18. The 
m utilization coefficient is .63 (cor- 
rected) for Z in the Naive group, 
howe which is much higher, 
° the mean utilization coeffi- 
cient for the Scmisophisticated groups 
only .38 and .27 for the Sophisti- 
tated clinicians, indicating much closer 
i oximation to the empirical value 
of .18. It appears plausible, then, that 
the Naive clinicians were relying on a 
textbook rule, whereas the Sophis- 
cated clinicians were depending on a 
üle developed from experience.* 
The highly accurate predictions 
made by the experienced clinicians 
would have to be explained in terms 
of optimal weighting. Given a mul- 
tiple R, of 1.00, there is only one set 
of weights which could be applied to 
the regression equation which would 
ult in maximum achievement. The 
high achievement indicates that the 
Perienced clinicians were actually 
pPproximating this unique set of 
Weights very closely. 
Although it might be argued that 
the clinicians were aware of the rule 
they were applying, and were consci- 


more than 


ously applying it with precision so 
that the R, closely approached 1.00, 
itstrains credulity to believe that they 
Were also consciously applying op- 
imal weights with the same precision 
D that achievement approached a 


uses, something that does influence W. 
| is a more accurate representative 
intelligence functioning per se. 

higher Z scores are always found in 
dividuals whose work requires most difficult 
d complex intellectual effort. Like W, it 
therefore an index of thinking power. Its 
= capacity to oe relations not 

y others [p. 12].” 4 

*See eo eee Crow, Githens, 
ck, Gyr, and Saunders (pp. 526-542, 
) for a detailed examination of differential 
nce on cues as a function of profes- 
onal interest on the part of raters. 
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w 
maximum.“ Because these conditions 


equation, it will be recalled, provided 


diction against actuarial 


«Both Todd (1954) and Hoffman (1960) 


ir subjects could not report 
foan, that gh ype gar See also 


tely 
Rommetveit (1960). 
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TABLE 2 


CORRELATION (C) BETWEEN NONLINEAR 
ASPECTS or JUDGES’ RESPONSE 
SYSTEMS AND RORSCHACH-IQ 
SYSTEMS FOR INDIVIDUAL 
Jupces v Grovrs* 


Naive | Semisophisticated | Sophisticated 


Median | 220 179 144 
Judge 1 | —.057 328 140 
Judge 2 220 035 107 
Judge 3 244 179 322 
Judge 4 030 214 056 
Judge 85 300 051 320 


Note.—Kruskal-Wallis H = eat A = ns. 
* Based on data from Grebstein (1963). 


predictive utility of the equation dis- 
appears when cross-validated on a 
new and, of course, different “sample” 
(cf., Sarbin, Taft, & Bailey, 1960, pp. 
247-250). If the samples are not 
random, the multiple-regression equa- 
tion is not only subject to shrinkage 
due to sampling errors, it is subject to 
systematic error as well. (If samples 
are indeed random, then, of course, 
the shrinkage of the multiple R can be 
computed directly (Guilford, 1956; 
McNemar, 1962.) 
On the other hand, neither Greb- 
stein’s nor Holt's clinicians were re- 
quired to have their performance 
completely determined by the idiosyn- 
cracies of a prior sample. In effect 
then, the multiple-regression technique 
is put to an unfair test whenever it is 
calculated from a single, small sample, 
because its experience“ is thereby 
severely limited. If the equation in 
Grebstein’s study had been allowed to 
develop from greater experience, how- 
ever, either as a result of exposure to 
several random samples, or through 
exposure to a very large N, it would 
in all likelihood have performed much 
better. 
Ironically, the highly linear per- 
formance and higher 7, of the clinician 
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lends support to this argument. For 
there was a remarkable similarity be- 
tween clinical inference and the mul- 
tiple regression model in this study. 
The clinicians’ inferential processes 
were nearly identical with the mul- 
tiple-regression procedure both in 
function and in content (see Grebstein, 
p. 129). Both procedures used es- 
sentially the same variables (M, 
F + , Z, and RC) in the same, i.e., 
linear, way. The clinicians therefore, 
were proving how good an experi- 
enced” linear process could be. If 
there is a dispute between clinical and 
statistical prediction, it would be 
difficult in this case to discriminate 
between the contestants’ performance. 

Since the purpose of this paper is 
heuristic, rather than substantive, let 
us now assume that the clinicians’ 
responses were not completely identi- 
cal with a multiple-regression equation 
based on any sample no matter what 
size. Indeed, there is a sense in which 
such linear indeterminacy defines the 
clinical method. This lack of com- 
plete linear determinacy in the clini- 
cians’ response system suggests the 
possibility that they utilize nonlinear 
as well as linear relations in making 
their judgments—a point to which we 
now turn, 


Correlation between Residual Variances 


(©) 


The data from Table 2 indicate that 
there were no differences between the 
three groups of clinicians in this 
respect. 

The analysis of C in Grebstein’s 
study has direct implications for test 
construction. First note that if 
R. = 1.00, there is, of course, no 
residual variance in the clinician's 
response system, and therefore no 
correlation is possible with the resid- 
ual variance in the environment. 
Whenever R, < 1.00, however, the 


possibility exists that the clinician can 
‘raise his achievement by correlating 
the residual variance in his response 
system with that in the environment, 
provided K, is also <1.00. (This 
possibility is not available to the 
multiple-regression equation because 
residual variance is treated only as 
random error.) The important point 
to be observed here is that C can be- 
come large only if there is some 
systematic, nonlinear variance in the 
environmental system which can be 
detected by the clinician. Because 
the multiple-regression procedure can- 
“not improve its performance in this 
way, the magnitude of C specifies the 
extent to which the clinician effectively 
uses those special properties which dis- 
linguish him from the multiple-regres- 
“sion equation. Of course, if the task 
does not provide the opportunity, he 
Cannot use them. 
Since all of Grebstein's clinicians 
had R,’s slightly less than unity (rang- 
ing between .84 and .98) and since 
approximately 40% of the variance in 
the Rorschach-IQ system is not 
accounted for by linear regression 
(R2 = .63), the question arises as to 
the extent of the correlation between 
the nonlinear variances in the two 
systems. The values of C in Table 2 
are quite low, ranging from —.06 to 
+.39, indicating little, if any, relation 
between the residual variances. 
Whether the residual variance in the 
Rorschach-IQ system is to be con- 
sidered as error, or as systematic, non- 
linear variance remains an open 
question. The obtained values of C 
indicate, however, that if any sys- 
tematic nonlinear variance existed, it 
was not, in the main, detected and 


Grebstein’s study. y 

This result raises several questions. 
oes the Rorschach-IQ system in fact 
have systematic, nonlinear variance 
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which is detectable? Should not this 
question be answered in a definitive 
form before requiring clinicians to 
attempt to detect such relations? A 
further question is: given the demon- 
strated linear relations between the 
Rorschach cues and IQ, would sys- 
tematic, detectable nonlinear variance 
make an appreciable difference in 
achievement even if it were detected? 

The answer to the latter question 
can be found by referring to the limits 
of achievement calculated earlier for 
Sophisticated Judge 5. There we 
observed that had this Judge main- 
tained the same degree of linearity 
(R. = .85) and the same degree of 
match (2d = .264), but increased C 
from .32 to 1.00, his achievement 
would have increased from .68 to .84. 
In view of the highly remote likelihood 
of achieving such a perfect correlation, 
the effort first, to discover perfect 
nonlinear relations in the Rorschach- 
IQ system, and second, to frain 
someone to achieve perfect recognition 
of these relations without reducing an 
already creditable overall achieve- 
ment makes such an ci 

to be a very poor risk i J 
eee if the clinician did 
achieve a C of 1.00, he would also 
have to obtain an R, exactly equal to 
Rè (.63), and a Xd of exactly zero in 
order to obtain perfect achievement 
(re = 1.00). Perfect matching of Rê 
and Sd on the part of the clinician 
seems much less likely than perfect 
(or near perfect) linearity. 

These conclusions bring us to the 
fundamental implications of the above 
considerations for test construction 
itself. s 

Nonlinearity and Test Construction. 
Those who develop psychological. tests 
which require clinicians to make infer- 
ences from the nonlinear aspects of the 
cue data provided by the tests should 
bear the burden of demonstrating that 
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such nonlinear relations (a) exist, (b) 
are detectable by clinicians, and (c) 
are worth whatever trouble it takes to 
detect them. Moreover, test makers 
will have to take into consideration 
the fact that our scientific knowledge 
concerning the human’s ability to 
detect nonlinear relations is hardly 
more than primitive. Indeed, the 
entire question of the nature of human 
information processing is still in its 
infancy. In view of all this, one 
wonders whether the present meth- 
odology for construction of tests 
which require clinical inference has 
not outrun our knowledge of cognitive 
processes, and whether future makers 
of such tests should not include in 
their methodology considerations of 
the psychology of inference (the 
“cognitive activity of the clinician” as 
Meehl, 1960, has put it) as well as the 
psychology of personality. This calls 
for nothing more than accepting the 
operational consequences of what 
everyone has long accepted—the no- 
tion that both test and clinician are 
interacting parts of the same system 
(cf. Hammond, 1955). 

This discussion does not, of course, 
exhaust the implications of non- 
linearity for test construction, but 


TABLE 3 


XD FOR INDIVIDUAL JUDGES 
AND Groups* 


Naive | Semisophisticated | Sophisticated 


Median | .700 458 331 
Judge 1 | 1.903 588 331 
Judge 2 .834 478 421 
Judge 3 .652 458 509 
Judge 4 367 276 180 
Judge 5 | .700 289 264 


Note. —Kruskal-Wallis H = 6.5; p < 05. 
Naive versus Semisophisticated p = 028. 
Naive versus Sophisticated p = .016. 
Semisophisticated versus Sophisticated p 


= .210. 
s Based on data from Grebstein (1963). 
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TABLE 4 


DIFFERENCES IN fa AMONG THREE GROUPS 
or CLINICIANS VARYING IN EXPERIENCE* 


Mann-Whitney U 


Naive versus Semi- 


sophisticated p = 028 
Naive versus 
Sophisticated .008 < p < 016 
Semisophisticated versus 
Sophisticated NS 
Kruskal-Wallis H = 7.215 .02 <p <.05 


a Based on data from Grebstein (1963). 


that topic is too large to be explored 
fully here. We turn now to a con- 
sideration of the third component of 
the analysis, 2d. 


The Degree of Match between Eco- 
logical Validities and Cue Utilization 
(2d) 


Grebstein’s three groups do not 
differ with respect to (a) the linearity 
of their response systems (R), or (b) 
the extent to which they were able to 
detect systematic, nonlinear com- 
binations in the residual variance of 
the Rorschach-IQ system (C); the 
three groups do differ with respect to 
(c) the extent to which utilization 
coefficients match ecological validities, 
however. 

The data in Table 3 show that there 
were statistically significant differ- 
ences between Grebstein’s groups 
with respect to Id; this component 
becomes progressively smaller, as it 
should, as the experience of the clini- 
cian increases. Furthermore, Ad 1S 
highly related to achievement within 
groups; the smaller Zd, the higher the 
achievement. (Rank difference COM 
relations for the Naive, Semisophis- 
ticated, and Sophisticated groups Are 
—.90, —.90, and —.72, respectively: 
2d, therefore, clearly has promise a$ 
an important component of clinica 
inference. 
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The fact that the three groups 
differed with respect to Zd, and that 
the within-group correlations between 
Xd and achievement were quite large, 
led to a re-examination of Grebstein's 
conclusions that the three groups did 
not differ with respect to achievement. 
When the achievement correlations in 
each group of clinicians were analyzed 
in terms of the Mann-Whitney U test 
(the results of Bartlett's Test of 
homogeneity having indicated a para- 
metric test to be inappropriate), 
significant differences between groups 
were obtained. The results of this 
analysis (see Table 4) are consistant 
with the analysis of Id progressive 
differences in achievement are ob- 
served as experience increases, al- 
though the difference between the 
Semisophisticated and the Sophis- 
ticated clinicians is not statistically 
significant. 

The inference that achievement 
might well be increased by increasing 
the match between ecological validi- 
ties and utilization coefficients, is 
therefore supported by the data in 
Table 4 which indicate that the groups 
with smaller Xd are the groups with 
greater achievement. (As pointed 
out above, the specific mismatch 
which led to a large 2d for the Naive 
clinicians was the overdependence on 
Beck's Z.) Id, however, is a highly 
complicated concept, and its relation 
to the use and misuse of cues is not 
unambiguous. 

Xd and the Weighting of Cues. The 
quantity Xd, while serving as an 
important component of achievement, 
does not easily lend itself to further 
analysis for two reasons. The first 
reason is the highly interdependent 
relations of the components of the 
multiple-regression analysis presented 
here. Changes in the value of any 
one of these components can result in 
changes in the values of other com- 
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ponents. The value of Xd itself as well 
as its relation to ra is dependent on the 
values of the other components. The 
contribution of Xd to ra, therefore, 
cannot be assessed without examina- 
tion of the other components as well. 
Thus, our treatment of Xd, in con- 
junction with our consideration of 
Rè and R., with an assumed or com- 
puted value for C, is a valid one, 
whereas the consideration of 2d alone 
might not be. 

Secondly, when Xd is computed 
from empirical data as in the present 
analysis, one may be tempted to look 
at each difference product, compare 
these with each other, and make 
statements about the relative con- 
tribution of each cue to Xd according 
to the size of its difference product. 
This, however, would be a hazardous 
procedure because the quantity 2d is 
not a simple sum of the mismatches, 
rather it is a resultant of interdepend- 
ent difference products (or vectors). 
For this reason, it is impossible to 
evaluate the independent effect of any 
one mismatch on the resultant Xd. 
(cf. Fink, 1961; Hoffman, 1960; Ward, 
1962.) 


Matching versus Linearity 


Earlier we discussed the implica- 
tions of perfect linearity on the part of 
the clinician. There we found that 
achievement, re, could be as large as 
R.. Under conditions of perfect match- 
ing of utilization coefficients with 
ecological validites (more precisely, 
conditions under which Zd equals 
zero), ra equals Rè, if we assume 
C = 0 (see Case II, Equation 13, in 
Hursch et al., 1963). 

Thus, if one of Grebstein’s clinicians 
had a Xd which equalled zero (and 
C also = 0), his achievement, fa, 
would be exactly equal to the square 
of the multiple correlation in the 
environment, .63. Since K < R, 
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perfect matching (2d = 0) always 
results in a lower limit of achievement 
than the limit provided by perfect 
linearity (assuming C = 0). This is 
a point worth remembering when 
comparing clinicians with the linear 
multiple-regression equation. 

Consider now Semisophisticated 

Judge 4 who had a high achievement 
correlation of .60, a very small 2d 
(275) and a relatively low R (.852). 
Consider also Naive Judge 1 who had 
the lowest achievement correlation 
(—.196), a very large Xd (1.90), and 
a very high R (.952). According to 
our earlier remarks, we conclude that 
because Semisophisticated Judge 4 has 
a smaller R, he is less rule bound than 
Naive Judge 1. Judge 4, however, 
has a higher ra in spite of this because 
his match is better, that is, his 2d is 
smaller. Should Naive Judge 1 be- 
come less rule bound or should he try 
to decrease Xd? The choice between 
changing R or Xd bears implications 
for cognitive theory related to the 
above discussion concerning linearity 
and the clinician. 


Matching and Cognitive Theory 


p Earlier it was asserted that increas- 
ing linearity implied increasing rule- 
bound behavior. Decreasing such 
behavior, then, implies moving in the 
direction of probabilistic behavior. 
Brunswik’s (1956, p. 89-99) theo- 
retical distinction between thinking 
and perception parallels the above 
distinction between rule-bound and 
probabilistic behavior. According to 
Brunswik, at one pole of the cognitive 
continuum lies “certainty-geared” de- 
ductive thinking, and at the other 
pole lies “uncertainty-geared” intui- 
tive perception. Deductive thinking 
implies rule-bound cognition. And 
probabilistic, intuitive perception 
should involve the matching of eco- 
logical validities of perceptual cues 
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with their respective utilization eo 
efficients (1956, p. 141). Between 
these two poles lies quasi-rationality 
(Brunswik, 1954; Hammond, 1961; 
Heider, 1958) an area in which both 
types of cognition participate. Howa 
subject distributes or employs these 
two types characterizes his inferential 
mode. The effectiveness of his inferen= 
tial mode is determined by the proper= 
ties of the environment. 
Within the Brunswikian framework, 
then, to urge Naive Judge 1 to give 
up his rule-bound response” system 
and to increase his match implies that 
he is to be urged to give up an 
analytical approach and to adopt an 
intuitive one. And to urge Semiso- 
phisticated Judge 4 to increase the 
rule-like quality of his response sy 
without decreasing his match is 
urge him to become more analyti 
without giving up his successful in- 
tuitive performance. Our knowledge 
of cognition, of course, does not permit 
us more than conjectures as to 
whether such changes are within the 
realm of possibility. A study by 
Rappoport (1963) however, indicates 
that pairs of subjects learning under 
an intuitive set engage in less inter- 
personal conflict than pairs of subjects 
learning under an analytical set. This 
result suggests that the response 
systems of rule-bound subjects are 
harder to change than those of 
intuitive subjects, and implies that 
it would be easier to change the m0) 
intuitive clinician (here Semisophis- 
ticated Judge 4) than the more 
analytical one (Naive Judge 1). 
We turn now to a brief examination 
of the performance of 10 experienc! 
clinicians with respect to all 
components of performance. 


Clinical Study IT 


An examination of Todd’s (195 
10 experienced clinicians shows 
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these clinicians (a) were also highly 
linear in their response systems (me- 
dian R of .93),* (b) had a mean 2d 
about the same as Grebstein's (.044 
versus .055) and, (c) had C values 
approximately equal to those of 
Grebstein’s clinicians, although seven 
of Todd's clinicians had negative Cs, 
which reduced their achievement (see 
Hursch et al., 1964, Table 5, for 
complete details). As in the case of 
Grebstein's clinicians, the rank order 
correlation between Xd and achieve- 
ment was substantial, being equal to 
—.76, while the rank-order correlation 
between R, and ra was negligible (.22). 
In brief, Grebstein’s results confirm 
those obtained by Todd. One factor 
contributing to this similarity of 
results may be that the multiple 
orrelation between Rorschach factors 
(Klopfer technique) and IQ in Todd's 
study was .77, and in Grebstein’s 
study (Beck technique) was .79. 


A QUASI-CLINICAL STUDY 


Newton (1963) carried out what 
might be called a study of quasi- 
clinical inference. His subjects were 
99 college sophomores who estimated 
Freshman grade averages (N = 53) 
from four cues (IQ, high school rank, 
a College Board Score, and a per- 
sonality rating by the high school 
principal), These subjects were also 
highly linear (mean R, over five condi- 
tions approximately .90), had a mean 
Xd identical with Todd’s clinicians 
5 = .05) and had negligible values 
of C. 

Newton's subjects were first asked 
to make their estimates of grade 
average on the basis of their campus 
experience. Four different types of 
feedback were then employed, each 

»The high R.'s obtained with Grebstein’s 
clinicians are of course, subject to shrinkage. 
The high R,’s obtained in Todd’s study, how- 
ever, were maintained on cross validation. 


w9 


TABLE $ 


RANK ORDER CORRELATIONS BETWEEN Do 
AND rą Baron (B) AN Arter (A) 
Durrer Conprrions oF 
Fyrbnack“ 


W iw 
SS —— ees C Se 
11 41424 nog 
30 | 41 |.37 | .37 % | .71 8 6 |.34 | 46 
1 21 e * 10 


» Data from Newton (1965). 


type providing progressively more 
information. (The fifth condition 
required the subjects to form an 
explicit inference rule.) The data 
of Table 5 show the rank-order 
correlations between Yd and achieve- 
ment, fs, in each condition. Although 
these correlations are not as high as 
those obtained Mee the Sage psy- 
chologists descri „they in- 
dicate substantially 2 trend, 
a definite relation between 2d and re. 


Implications of Variation in Feedback 
for Improving Achievement 


Newton's study illustrates a further 
point concerning clinical inference 
discussed by Holt (1958, p. 5). In 
disputing the current methodology 
employed in comparing the efficiency 
of clinical and actuarial methods of 
prediction, Holt argued that the 
clinician’s performance should be ex- 
posed to the same sort of self-correc- 
tion employed in actuarial methods. 
That is, just as the statistician ex- 
amines the validity (or weights) of 
various items or tests, the clinician 
should be allowed to examine the 
empirical validity of the variables he 
is depending upon for his inferences. 

Although Newton's study was car- 
ried out for different purposes, his 
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results bear directly on Holt's point. 
His subjects estimated Freshman 
grade averages on the basis of four 
cues, as indicated above. Each sub- 
ject was then assigned to one of five 
different conditions in which different 
forms of feedback were provided (see 
Fig. 2). The results indicate that 
those subjects who were told what the 
ecological validities in the sample 
were, and what their utilization coeffi- 
cients were, improved (not substan- 
tially, but to a statistically significant 
degree) in their second set of 53 trials. 

The fact that college sophomores 
improved their achievement after one 
exposure to the ecological validities 
and to their own manner of utilizing 


remarkable, Whether it opens the 
door to new training methods for 
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inferential tasks, at least remains to 
be cen. Holt's position, in any event, 
is confirmed by Newton's results. If 
subjects are provided with the appro- 
priate information concerning cues 
and their responses, they can improve 
their performance. Lee and Tucker 
(1962) report a similar result with 
respect toa multiple-probability learn- 
ing experiment.‘ 


A MULTIPLE-CUE PROBABILITY 
LEARNING EXPERIMENT 


As a part of a larger study, Rappo- 
port asked subjects to estimate the 
values of numbers on the back of 
3 X 5 inch cards on the basis ðf the 

*A recently completed experiment by 
Todd and Hammond (1964) confirms the 
finding that statistical feedback of this sort is 
superior to traditional outcome feedback in 
improving the performance of subjects in a 


clinicians—with respect to certain multiple- cue probability learning task 
och och och 
* 
. 
* 
è utit ecol 
Feedbock I xu — 
‘ sch och 
S forms 
rules 
ecol util ecol util 
x X 


Fic. 2. Different types of feedback provided by Newton (1963). 


sizes of three geometrical 
on the front of the cards which 
ad probabilistic relations with the 
mber on the back of the cards. The 
validitics were 48, .54, and 81, 
e R, for the task was .99. Aside 
om the high Re this study is 
gous to the task facing the 


probabilistic relations 
It is different, however, in 
that it is a learning task, and thus 
similar to the studies by Smedslund 
(1955), Summers (1962), and Lee and 
ker (1962). 

The results from Rappoport’s study 
are taken from 15 randomly selected 
Subjects’ performance on the first 50 
trials, although the asymptotic level 

d not been reached for all subjects: 
(a) the median R is .61, again indicat- 
ing a fairly high degree of linearity 
n the part of the subjects; (b) negli- 
gible amounts of C are present, in- 
dicating no correlation between non- 
linear aspects of the stimulus situation 
and the subjects’ response system— 
which is as it should be since the R 
Controlling the stimulus arrangement 
is .99, thereby leaving virtually zero 
Variance remaining in the environ- 
ment; (c) mean Sd = .649, somewhat 
ger than that observed in the 
e-mentioned studies. (It must 
be remembered that these subjects 
vere still in an early stage of learning.) 
As in the previous studies, we find 
‘substantial correlation between fd 
and ra; at the end of 50 trials it is 
—.86. Because the multiple R con- 
rolling the stimulus arrangements 

as virtually 1.00, it is noteworthy 
the rank order correlation be- 
en Xd and R, is —.70. ] 

ese results make clear the im- 
ations of the equation 

Ri +R? = 24 
25 2 


To 
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asi 
for when R? is virtually 1.00 as io 


oping a completely linear response 
system) and minimizing LA (matching 
ecological validities with utilization 
coefficients). It is for this reason that 
Rappoport's study is the only one of 
those discussed here in which there is 
a high correlation between K. and re. 
In the other studies, the departure 
of R, from unity was sufficiently large 
to reduce substantially the correlation 
between R, and fre In these studies, 
subjects could increase their achieve- 
ment either by raising R, or reducing 
Id; here it was incumbent upon them 
to do both, This illustrates the fact 
that the nature of the environment 


of the response system. 
fundamental sense, it 
Brunswik's (1957) argument that the 
environment deserves as much con- 
sideration as the organism in the effort 
to understand achievement. 


IMPLICATIONS FOR Test 
Construction 


Cronbach (1956) pronounced glo- 
bal assessment“ to be a failure. He 
then proceeded to put his finger on 
the inferential process as the trouble 
spot: “Assessment encounters trouble 
because it involves hazardous infer- 
ences.” And the difficulty is charged 
specifically to the fact that assessors 
“attempt a maximum inference from 
tests [p. 173].” Cronbach (1956) saw 
the prime hazard to be that, 


Assessors have been foolhardy enough to 
venture predictions of behavior in unanalyzed 
situations, using tests whose construct inter- 
pretations are dubious and personality theory 
which has more gaps than solid matter [p. 
173]. 
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While we are in general agreement 
with Cronbach’s pronouncement, our 
analysis of inference leads us to argue 
that the very process of test construction 
itself must take into account the 
psychology of inference if the test 
data are to be used as a basis for 
inference. This means that experi- 
mental psychology will have to be a 
partner in the enterprise. Use of the 
lens model, although not mandatory, 
will expedite matters. 


Lens Model, Experimental Psychology, 
and Clinical Inference 


The lens model has an environ- 
mental side (to which we applied R.) 
and an organismic side (to which we 
applied K.) . As far as the problem of 
clinical inference is concerned, each 
side defines a set of problems and 
perhaps a division of labor. The 
environmental side involves the prob- 
lems of test construction, a problem 
to which psychometricians and psy- 
chodiagnosticians devote their ener- 
gies with little concern for the 
cognitive process of the clinicians who 
must infer the psychological state of 
the patient from the test data. The 
organismic side involves the cognitive 
problems of learning, thinking, and 
perception, problems to which experi- 
mental psychologists devote their 
energies with little concern for the 
problem of inferring covert states from 
a 1 complex composed of 
interdependent, uncertain data such 
as those provided by psychological 
tests. If the lens model does nothing 
else, it should make plain the fact 
that clinical test construction and the 
experimental psychology of inference 
provide a research domain to which 
both can contribute, and from which 
both can profit. 

Those experimental psychologists 

who are interested in the psychology 
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of inference will be likely to differ with 
clinical psychologists, however, as to 
the next step. The former are likely 
to seek answers by simplifying the 
inferential tasks studied here; the 
latter by increasing their complexity. 
Because our prejudice is with the 
latter, we will consider some of the 
steps that can be taken toward in- 
creasing complexity. First, however, 
we must note that requests for in- 
creased complexity can be directed 
toward both sides with equal force. 
If experimentalists are to be asked to 
construct and to study problems of 
greater stimulus complexity, clini- 
cians and test makers can be asked to 
construct tests in ways which are 
sustained by more than tradition. 

Although the clinical studies re- 
viewed above involve more complex 
stimulus arrangements than those 
customarily employed in experimental 
psychology, they are also clearly 
vulnerable to the criticism of over- 
simplification. Estimating the pa- 
tient’s location on a single variable 
such as IQ is a task which is a priori 
highly conducive to a linear response 
system. We are confident that linear 
response systems will appear in future 
studies of this type, and that such 
studies will find small differences 
between the cognitive processes of the 
clinician, or any human subject, 
and the multiple-regression equation. 
Such similarity, if found to be a fact, 
would not be unimportant, and a great 
deal needs to be learned about its 
implications for learning, thinking, 
and perception as well as its practical 
implications for clinical prediction. 
But it is even more important to 
discover how human subjects dea 
with more complex systems. Can the 
lens model method of analysis be of 
use with these problems? 

The answer is yes, and at least two 
directions of research are indicated- 
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The first is concerned with the en- 
vironmental or test side of the lens 
model, the second deals with the 
problem of theoretically oriented in- 
ferences, 


Cue-Criterion Systems 


Little if anything is known about 
the structure or dynamics of the 
inferential situations with which clini- 
cians are now confronted in their 
diagnostic work. Although a clinician 
may discover from the literature that 
a given test discriminated one diag- 
nostic group from another ( < .05), 
and may even learn that certain items 
discriminated between these groups 
(p < .05), this information is of little 
use to him. For such information 
does not indicate what the ecological 
validities are between cues and vari- 
able(s) to be estimated, what the 
reliabilities of the cues are, how much 
redundant information (cue inter- 
correlation) is present, how much 
determinacy there is in the cue- 
criterion variable system as a whole, 
or how much and which part of the 
system is dependably linear and how 
much and which part is nonlinear. 
And, of course, the clinician knows 
little if anything about his own 
cognitive processes. One result of the 
failure of clinical research to focus on 
the structure and dynamics of the 
clinical task is that research hardly 
affects clinical practice. The connec- 
tion between research and function is 
too remote. 

If clinical research were to discover 
what the cue-criterion structure is for 
the inference tasks clinicians must 
cope with, one next step would be to 
undertake experimental research in 
the laboratory in order to learn how 
(and to what extent) human cognitive 
Processes can cope with such multiple- 
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Probability cue structures. Such an 
interchange should lead to research 
which would rationalize the structure 
of the clinicians’ inferential task. 
And such an interchange would enable 
the experimentalist to study more 
complex cognitive tasks than he has 
chosen to undertake so far, 

Although it is possible with the aid 
of the computer to construct labora- 
tory analogs of present clinical infer- 
ence tasks, it would be more to the 
point to start fresh. Present infer- 
ential tasks were not developed with 
specific consideration given to the 
cognitive capacities and processes of 
the humans required to make infer- 
ences from the data. Therefore, it can 
safely be predicted that simulation of 
current inferential tasks would merely 
uncover their relative deficiencies, 
And since enthusiasm for current 
procedures seems to be on the wane in 
any event, the discovery of further 
deficiencies seems beside the point. 
A fresh start would require that con- 
struction of cue-criterion systems and 
the analysis of cognitive processes be 
carried out in relation to each other. 
This suggestion involves nothing more 
than what would be required of any 
newly constructed stimulus display, 
for example, say, a cockpit design for 
dials and gauges (tests) from which a 
pilot must make inferences. 

But reasons for a fresh start stem 
not only from the question of the 
inferential capacity of the one in 
relation to the structure and dynamics 
of the inferential task, but also from 
questions concerning the role of theory 
in clinical (or other) inferences. 


Theoretically Oriented Inferences 


As Meehl (1960) has pointed out, a 
purely “dispositional,” inductive ap- 
proach is not the only way to conceive 
of the clinicians’ task. 
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An alternative, more exciting (and more con- 
genial to the clinician's self-concept) is to view 
the clinician's cognitive activity as aiming at 
the assessment of hypothetical inner states 
Which belong to the domain of theo- 
retical entities [p. 20]. 

In order to apply the lens model to 
this situation, which we agree is at 
once more interesting and more repre- 
sentative of clinical inferences, a 
double lens model is necessary (see 
Fig. 3). On the left-hand side of the 
double lens model a distal trait, or 
theoretical entity is included which is 
related to proximal traits (Meehl's 
dispositional terms), which, in turn, 
are probabilistically related to directly 
observable cues (41. . A+) and (Bi 
. . . By), as in the single lens model 

cf., Meehl, 1960, also Sarbin, Taft, & 
Bailey, 1960, on probabilism). 

On the right-hand side of Figure 3 
are the inferences made by the clini- 
cian. As in the single lens model, the 
correlation between i and Ti indicates 
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the accuracy of the clinician's infer- 
ence about 7}; the correlation be- 
tween specific cues A; . . . Ax and 
indicates the clinician's dependence on 
these cues. This dependence may be 
compared with the ecological validities 
of the cues (but see above remarks 
concerning weighting of cues). 

Of course, in order for an analogy 
to be drawn between this model and 
clinical inference, the clinician must 
have a theory as to how Y might be 
inferred from variables 7, and Ta 
Moreover, the theory must be explicit 
as to how the impalpable T, and Ts 
are to be inferred from the palpable 
but probabilistic cue data of each 
single lens model. 

The double lens model provides the 
means for undertaking the research 
which is adequate to the complexity 
of theoretically oriented inferences. 
As theory encompasses more Cues, 
more proximal traits, and more distal 


inferred inferred 
“proximal” distal 
troit trait 


CLINICIAN SIDE 


Fic. 3. The double lens model. (Width of line indicates hypothetical 
degree of correlation.) 
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traits, then, of course, the double lens 
model will have to be expanded to a 
triple lens model, and so on. If this 
appears to involve more complexity 
than is desirable, all we need to 
remember is that no more complexity 
need be put into the test or environ- 
mental side than the organismic side 
can cope with. At the same time, we 
need to put as much complexity in the 
environmental side as the organism 
can cope with if we ever hope to 
understand the full complexity of the 
psychology of inference. 

The lens model, then, provides a 
framework for research which places 
the inferring organism and the infer- 
ential situation in their proper relation 
to one another. Furthermore, it 
makes possible research which permits 
a degree of complexity characteristic 
of the human inference situation while 
providing a precise analysis of both 
the situation and the organism's 
inferential process. 
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MOTION AND EMOTION: 


PATHOLOGY OF THE EMOTIONS * 
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Proprioceptive discharges contribute to the physiological processes 

underlying the emotions in two ways: (a) by the setting of the hypo- 

thalamic balance which, other conditions being equal, is determined by 

the posture of the body through the total quantity of proprioceptive 

impulses impinging on the posterior per unit of time; 
contraction 


(b) through facial 
charges via the 


hypothalamus 
i patterns which lead to afferent dis- 


system and interact with cu- 


hypothalamic-cortical 
taneous facial impulses in the cortex. It is pointed out that states of 
emotional tension 


abnormal 


are alleviated in various “relaxation” 


therapies through reducing proprioceptive impulses which impinge on 


the posterior 
abnormal state of excitation. 


He who always wears the mask of a 
friendly man must at last gain a power 
over friendliness of disposition, with- 
out which the expression itself of 
friendliness is not to be gaine 
finally friendliness of disposition gains 
the ascendancy over him—he is benev- 
olent. 

—FriepricH NierzscHE (1876) 


There are several lines of evidence for 
relating the emotional state and the 
activity of the striated muscles. The 
psychogalvanic reflex, admittedly a 
good indicator of emotional tension, is, 
according to Freeman (1948), “the 
best general index of the amount of 
total neuromuscular activity Ip. 13].”* 
Total metabolism and activity of the 
striated muscles are greatly increased 
during mental work which arouses 
emotional reactions, but these physio- 
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8 See also Malmo and Davis (1956). 


and maintain the cerebral cortex in an 


logical effects are absent if the same 
task is carried out without emotional 
excitement (von Eiff. 1952, 1957). 
Studies on psychoneurotics show that 
“progressive relaxation” leads pari 
passu to a decrease in muscle tone and 
in emotional disturbances (Jacobson, 
1938). That anxiety states are asso- 
ciated with increased tension of the 
skeletal muscles as disclosed by the 
electromyogram has been pointed out 
by numerous authors.“ These data 
challenge the neurophysiologist to 
elucidate the underlying mechanisms 
which seem to be of importance for 
the therapy of emotional disturbances 
and for the understanding of the phys- 
iology of the emotions. Obviously, the 
emotions lead not only to autonomic 
but also to somatic discharges. They 
induce manifold changes in the tonic 
and phasic activity of the striated mus- 
cles. The former appear as alterations 
in posture, the latter as contractions 
of the facial muscles. Both are charac- 
teristic for each emotion, and will be 
discussed from the physiological point 


See Balshan (1962) for the literature. 
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of view following an outline of hypo- 
thalamic functions. 


CHARACTERISTICS OF HYPOTHALAMIC 
FUNCTIONS 


The investigation of autonomic func- 
tions by Hess (1949) and others (Gell- 
horn, 1957b; Ranson, 1940) have 
clearly shown that the anterior hy- 
pothalamus primarily influences par- 
asympathetic activity whereas the 
posterior hypothalamus regulates sym- 
pathetic processes. The physiological 
significance of these processes becomes 
obvious in the behavior of the unan- 
esthetized animal in which the auto- 


nomic changes are associated with aes 8 


characteristic alterations in the state 
of the somatic nervous system as indi- 
cated by the activity of the striated 
muscles. Sleep occurring spontane- 
ously or resulting from excitation of 
the anterior hypothalamus is associated 
with increased parasympathetic activity 
(Ban, Masai, Sakai, & Kurotsu, 1951) 
and relaxation of the muscles whereas 
in wakefulness or on stimulation of 
the posterior hypothalamus sympa- 
thetic and somatic activity is increased 
(Hess, 1949; Ranson, 1934). Simi- 
larly, on exposure to cold, the sympa- 
thetic and the somatic systems (the 
state of the latter indicated by shiver- 
ing) are activated whereas on expo- 
sure to heat, parasympathetic dis- 
charges associated with relaxation of 
the striated muscles prevail (Ranson, 
1940). 

In contrast to earlier concepts, ac- 
cording to which the posterior hypo- 
thalamus was thought to be responsible 
solely for the motor expression of the 
emotions (Masserman, 1943), it is 
now believed that its excitation elicits 
processes which underlie also the sub- 
jective phenomena of emotion. The 
reasons are that on hypothalamic stim- 
ulation the unanesthetized cat shows a 
well-directed attack during, and be- 
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havioral aftereffects following, 
lation like an emotionally 
animal (Hess, 1949; Was 
Flynn, 1962). Moreover, stimulati 
of the posterior hypothalamus 
serve as an unconditioned 
(US) since a sensory 
stimulus reinforced adequately b 
hypothalamic stimulus elicits an 
tional response similar to tha 
duced by the US (Affani, Marchiafi 
& Zernicki, 1962; Nakao, 1958). 
These data suggest hat hypothe 
mic excitation produces not only 
nomic-somatic “downward” discharge 
but also “upward” discharges to # 
Experiments invol¥ 
stimulation of the motor area © 
the cortex showed that hypothalami 
stimulation facilitates cortically 
duced movements (Murphy & 
horn, 1945b) and increases pyra 
discharges (Gellhorn, 1957b). U 
the influence of stimulation of thes 
pathetic division of the hypothalan 
“Dial” waves disappear from the 
tex and are replaced by more fi 
potentials signifying desynchronizs 
while cortical strychnine spikes in 
crease in frequency. These effects are 
bilateral and reversible (Murphy 
Gellhorn, 1945a). 1 
Magoun, Moruzzi, and their col 
laborators (Magoun, 1954, 1958)* 
covered that stimulation of the 
formation in the brain stem (inclu 
the hypothalamus, subthalamus, 
the reticular thalamic nuclei) prod 
a similar diffuse excitation of the 
tex and, in the unanesthetized anin 
arousal, whereas lesions in the m 
part of the brain stem induce 
in spite of the integrity of the lo 
cending afferent system (Lindsle) 
Bowden, & Magoun, 1949; Line 
Schreiner, Knowles, & Magoun, 19 ; 
5See also Gellhorn and 


(1963), and Rossi and Zanchetti (1957): 
® The diffuse cortical excitation 
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It is believed that tonic afferent im- 
pulses from the reticular formation 
keep the hypothalamus in a reactive 
state.’ 

The hypothalamus exerts a tonic 
(excitatory) influence on the cortex 
since injection of a barbiturate or co- 
agulation of the posterior hypothala- 
mus lessens the state of cortical excit- 
ability (Gellhorn, 1957a; Gellhorn & 
Ballin, 1959; Kennard, 1943; Koella 
& Gellhorn, 1954; Obrador, 1942). 
On testing phasic reflexes, it is found 
that with increasing degrees of excit- 
ability of the posterior hypothalamus 
(induced by intrahypothalamic injec- 
tion of minute quantities of metrazol 
or strychnine) reflexes involving the 
sympathetic nervous system increase 
in intensity, whereas coagulation of the 
posterior hypothalamus or local injec- 
tion of barbiturates exerts the opposite 
effects (Gellhorn, 1957b; Gellhorn, 
Nakao, & Redgate, 1956; Gellhorn & 
Redgate, 1955). Similarly, the degree 
of hypothalamic-cortical discharges fol- 
lowing direct or reflex stimulation 
parallels the state of excitability of the 
posterior hypothalamus (Bernhaut, 
Gellhorn, & Rasmussen, 1953; Gell- 
horn, 1957a; Gellhorn & Ballin, 1959). 
These investigations suggest that sym- 
pathetic discharges occurring spon- 
taneously or in response to reflex stim- 
ulation reflect not only the state of 
hypothalamic excitability, but also the 
intensity of the hypothalamic-cortical 
discharges.* ‘ 

Whereas the ergotropic syndrome 
consisting of increased sympathetic 
tion is abolished small doses of bar- 
biturates which 2 80 interfere with the 
conduction of the specific afferent systems 
to the sensory projection areas (French, 
Verzeano, & Magoun, 1953; Gellhorn (1953). 

T See Gellhorn and Loofbourrow (1963) 
Concerning important differences between 
reticular formation and hypothalamus. 

For hypothalamic-limbic relation and 
further details, see Gellhorn and Loofbour- 
Tow (1963). 


hypothalamus, the trophotropic syn- 
drome is evoked by stimulation of the 
anterior (Ban et al, 
1951; Hess, 1949): 


(Nauta, 1946) resulting from the re- 
lease of the posterior hypothalamus, 
whereas lesions in the latter cause som- 
nolence (Ranson, 1934) suggesting 
dominance of the anterior hypothala- 
mus. 

If through direct stimulation or ap- 
propriate reflex stimuli the hypothala- 
mic balance is altered, changes in hy- 
pothalamic reactivity occur which are 
due to reciprocal inhibition. Thus. 


fect is produced by the electrical stimu- 
lation of the anterior hypothalamus 
(Andersson & Persson, 1957). For 


See Gellhorn and Loofbourrow (1963) 
concerning other cerebral structures which 
induce ergotropic and trophotropic effects. 
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our problem it is of particular interest 
that these stimuli alter the electro- 
corticogram and the proprioceptive dis- 
charges via the gamma system (von 
Euler & Söderberg, 1957) at the same 
time: The cortical potentials become 
grouped, their frequency is slowed, 
and their amplitude increased (as in 
sleep) while the proprioceptive im- 
pulses decline in frequency. Appar- 
ently the inhibition of peripheral sym- 
pathetic and somatic discharges is 
paralleled by an inhibition of the dif- 
fuse discharges from the posterior hy- 
pothalamus and the reticular formation 
to the cortex. Finally, the temperature 
threshold of the anterior hypothalamus 
to produce vasodilator action was 
found to be increased during nocicep- 
tion, which stimulates the posterior 
hypothalamus, but decreased during 
stroking of the back (von Euler & 
Söderberg. 1958), which seems to 
activate the anterior hypothalamus. 
The feeling of comfort in warm en- 
vironment, seemingly displayed also by 
the cat on stroking the back, is believed 
to be the result of a shift in the hypo- 
thalamic balance to the parasympa- 
thetic side, whereas the opposite change 
in balance resulting in an increased 
dominance of the ergotropic system on 
exposure to cold or nociceptive stimuli 
appears to be related to the feeling of 
unpleasantness and tension.?° 


AFFERENT IMPULSES AND 
HYPOTHALAMIC BALANCE 


Koch showed in 1932 (Gellhorn & 
Loofbourrow, 1963) in dogs with ex- 
teriorized carotid sinuses that an in- 
creased intrasinusal pressure led to a 
decrease in muscle tone, lessened so- 
matic excitability, and sleep. On the 
other hand, when the pressure in the 
carotid sinus was lowered, the general 

10 Concerning the relation of these phe- 


nomena to withdrawal and thigmotactic re- 
flexes, see Ebbecke (1959). 
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excitability and activity of the animal 
was greatly increased." Moreover, 
the knee jerk was found to be inhibited 
with increasing intrasinusal pressure 
(Spychala, 1932). 

In the light of present knowledge 
these experiments suggest that varia- 
tions in intrasinusal pressure alter the 
ergotropic system and the state of the 
cerebral cortex through the upward 
and downward discharges from the 
posterior hypothalamus and reticular 
formation. This interpretation is based 
on the fact that changes produced in 
the sino-aortic pressure by mechanical 
means (Bonvallet, Dell, & Hiebel, 
1954) or by drugs (Nakao, Ballin, & 
Gellhorn, 1956) alter cerebral cortical 
potentials in the expected manner pro- 
vided that the baroreceptors are intact 
(Gellhorn, 1959; Nakao et al., 1956). 
Furthermore, the rise in sino-aortic 
pressure known to cause increased dis- 
charges from the baroreceptors (Hey- 
mans & Neil, 1958) causes an increased 
reactivity of the anterior and a de- 
creased responsiveness of the posterior 
hypothalamus, whereas the opposite 
changes accompany a fall in the pres- 
sure of the carotid sinus (Gellhorn, 
1957b, 1960; Redgate & Gellhorn, 
1955). f 

Sensory stimuli (acoustic < proprio- 
ceptive < nociceptive) lead to a diffuse 
excitation of the cerebral cortex, move 
ments, and sympathetic discharges 
(Bernhaut et al., 1953). There 1s a 
parallelism between hypothalamic and 
diffuse cortical excitation suggests 
that the sensory stimuli induce a hypo- 
thalamic-cortical discharge. The fail- 
ure of nociceptive stimuli to evoke a 
diffuse cortical excitation in cats W! 
bilateral lesions in the posterior hypo- 
thalamus (Koella & Gellhorn, 1954 


u Increase in sino-aortic pressure Was 
shown to decrease experimentally induces 
conyulsions, and vice versa (Gelhorn, Yesi- 
nick, Kessler, & Hailman, 1942). 


supports this interpretation, Cold and 
stress in anticipation of icepti 
stimuli likewise increase the activity 
of the ergotropic system. 

On the other hand, stroking the back 
elicits parasympathetic effects and syn- 
chronized cortical potentials while the 
muscles relax and the gamma dis- 
charges diminish (von Euler & Séder- 
berg, 1958). Stroking also produces 
muscular relaxation and cessation of 
crying even in infants without cerebral 
_ hemispheres (Edinger & Fischer, 
1913) and inhibits cortically induced 
movements in anesthetized animals 
(Bubnoff & Heidenhain, 1881). It is 
suggested that these effects are due to 
the excitation of the anterior hypo- 
thalamus with reciprocal inhibition of 
the posterior division leading to less- 
ened hypothalamic-cortical discharges. 
It seems to follow from the work 
cited thus far that an increase in the 
state of activity of the posterior hypo- 
thalamus, regardless of whether it is 
induced by direct or reflex stimuli or 
whether it results from a shift in the 
intrahypothalamic balance, leads to a 
diffuse cortical excitation whereas in- 
creased discharges from the anterior 
hypothalamus are associated with a 
reduction in the state of cortical exci- 
tation. If the striated muscles are not 
curarized, muscle tone and activity 
(including proprioceptive discharges) 
are enhanced on stimulation of the 
posterior hypothalamus and inhibited 
in response to the excitation of the 
anterior hypothalamus. Other condi- 
tions being equal, the muscle spindles 
increase their rate of discharge with 
increasing activation of the gamma 
(small nerve fiber) system (Kuffer, 
Hunt, & Quilliam, 1951). This acti- 
vation is easily evoked by stimulation 
of the reticular formation (Granit, 

1955) and hypothalamus (Granit & 
Kaada, 1952). Bearing in mind that 
proprioceptive impulses are effective 
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stimulants of the posterior hypothala- 
mus, it is concluded that the excitation 
of the sympathetic division of the hy- 
pothalamus in emotion leads to move- 
ments reinforced by a positive feed- 
back mechanism. 

Since hypothalamic excitation re- 
sults in a diffuse cortical excitation 
in completely curarized animals and 
also in the encéphale isolé, the cortical 
changes are obviously not simply the 
result of increased i ive dis- 


such discharges contribute, via the 
posterior hypothalamus, to cortical ex- 
citation. A further proof for this 
interpretation lies in the following ex- 
periments in which the influence of 
reduced proprioceptive discharges on 
the state of activity of hypothalamus 


and cortex was studied. 


Tue INFLUENCE or Biockinc Pro- 
PRIOCEPTIVE DISCHARGES ON THE 
STATE or THE HYPOTHALAMIC 
System 


Cats, prepared under ether and local 
anesthesia (or after intravenous ad- 
ministration of Pentothal) were used 
several hours later. The posterior hy- 

us was stimulated and the ef- 
fect of curare-like drugs on this stimula- 
tion was recorded on the blood pressure 
and the nictitating membrane (n.m.). 
It was found regularly that Intocos- 
trin (administered intramuscularly) in 
doses which do not alter blood pres- 
sure reduces the sympathetic effects 
of hypothalamic stimulation: The rise 
in blood pressure and the contraction 
of the n.m. are lessened. Since the 
action of Intocostrin is in general too 
prolonged to demonstrate the reversi- 
bility of this action, additional experi- 
ments were performed with Anectine 
(administered intravenously). Asimi- 
lar but reversible reduction in sympa- 
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thetic hypothalamic responsivness was 
recorded (Gellhorn, 1958a, 1958b). 
In view of the fact that previous 
work had shown that changes in the 
excitability of the posterior hypothala- 
mus lead to alterations in sympathetic 
reactivity as well as to corresponding 
changes in the intensity of the hypo- 
thalamic-cortical discharges, it was ex- 
pected that these drugs which elimi- 
nate proprioceptive impulses would 
lessen the excitability of the cerebral 
cortex. It was found, indeed, that the 
small frequent cortical potentials sig- 
nifying asynchrony and excitation were 
replaced by synchronous, grouped po- 
tentials (as in sleep) after the admin- 
istration of Intocostrin or Anectine. 
The simultaneous recording of the hy- 
pothalamic potentials disclosed similar 
changes. The frequency analysis of 
these potentials showed that after Into- 
costrin the amplitude of the integrated 
potentials was greatly increased for the 
low frequencies, an effect which is 
typical for the reduction in excitability 
and increased synchrony. Finally, it 
was found that the activation of the 
hypothalamic-cortical system through 
reflexes was lessened by Intocostrin. 
Thus the action of nociceptive stimuli 
to produce an asynchrony of the po- 
tentials of hypothalamus and cortex 
was reduced in duration and intensity. 
These experimental results were con- 
firmed by Hodes (1962) who also 
produced similar effects with Flaxedil. 
This drug, which maintains cortical 
synchrony and a sleeplike state be- 
haviorally (slit pupils!) for hours, 
failed to alter the EEG when admin- 
istered intra-arterially. As in the ex- 
periments with Intocostrin, it is there- 
fore the lack of proprioceptive impulses 
inpinging on the central nervous sys- 
tem (due to the blocking of the neuro- 
muscular junction) and not a direction 
action on the brain which is responsible 
for the behavioral effects and the action 
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on the cortical potentials of these 
drugs. As Flaxedil wears off, the 
cortical potentials become asynchro- 
nous, the pupils dilate, and spontaneous 
movements occur: The reappearance of 
proprioceptive impulses induces an 
alert state through the activation of 
the reticular formation and the hypo- 
thalamic-cortical system. 

In the normal organism there is a 
balance between an activating system, 
which produces cortical asynchrony 
and increases sympathetic discharges, 
and a recruiting system, which causes 
grouped (synchronous) potentials and 
parasynpathetic discharged to prevail 
(Tokizane, Kawakami, & Gellhorn, 
1957a, 1957b). The weakening of the 
activating system may be accomplished 
by reducing or eliminating the pro- 
prioceptive impulses. Under these 
conditions synchrony is produced by 
subthreshold concentration of barbitu- 
rates (Hodes, 1962) or by a sub- 
threshold shock applied to the recruit- 
ing system, eg., via the caudate 
nucleus. 2 This shift in cerebral bal- 
ance appears as relaxation, tendency 
to sleep, and lessened responsiveness 
to emotional excitation and pain.“ It 

12 The drug Parpanit, which reduces mus- 
cle tone in the spinal animal (Fleisch & 
Baud, 1948), lowers the recruiting threshold 
of the caudate nucleus (Tokizane, Kawa- 
kami, & Gellhorn, 1957). 

18 With respect to pain, it should be added 
that the perception of nociceptive stimuli an 
the reaction to them requires more than the 
integrity of specific afferent pathways. This 
is illustrated by the relief from severe pain 
by frontal lobotomy (Watts & Freeman, 
1948), the lessened reaction to pain in ani- 


mals with lesions in the midbrain reticular 
formation (Melzack, 1961) and in dogs 
reared in isolation (Melzack & Scott, 1957)- 
Pain is, apparently, the result of an inter- 
action between sensory impulse and emo- 
tional experience. The reduction in the ex- 
citability of the posterior hypothalamus 
lessens emotional responsiveness regardless 
of whether this effect is due to a loss OF 
reduction of proprioceptive impulses ou 
the absence of reinforcing stimuli whic 


seems to account for the success of 

yarious relaxation therapies (Jacob- 

son, 1938; Kretschmer, 1949; Schultz 

& Luthe, 1959) in psychoneuroses and 
allied conditions. 
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Different settings of the hypothala- 
mic balance may be produced by dif- 
ferent types and intensities of afferent 
stimulation. Considering the latter 
first, it is apparent that the loss in 
muscle tone which accompanies sad- 
ness and similar moods (Klaesi, 1953) 
is associated with characteristic pos- 
tures which are diametrically opposed 
to those seen in a happy mood. The 
low intensity of proprioceptive im- 
__ pulses prevailing in a state of sadness 
would seem to contribute to the shift 
in hypothalamic balance to the para- 
sympathetic side and to its mainte- 
nance, and a corresponding statement 
would apply to the role of increased 
proprioceptive discharges and the state 
of sympathetic hypothalamic domi- 
nance in some forms of the happy state. 
Although a specific stimulus, mostly 
in the form of symbols (words seen 
or heard), appears to be the direct 
cause of mood or emotion, the setting 
of the hypothalamic balance through 
the total quantity of proprioceptive 
impulses impinging on the hypothala- 
mus per unit of time is of considerable 
importance. If on receiving the news 
of a great loss, one would make up 
‘one’s mind to strut back and forth 
with chest expanded, this posture 
would interfere with the development 
of a sad mood appropriate to the occa- 
sion. Conversely, sitting with hunched 
shoulders as if one had just heard of 
a tragedy would hardly allow one to 
impinge upon the hypothalamus from the 
cortex in general (Murphy & Gellhorn, 
1945) and the frontal lobe in particular 
(Ward & McCulloch, 1947). 
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ing to a great Mozart quartet. This 
close relation between bodily posture 
and mood, recognized in the German 
language by the word Haltung, which 
means external carriage (posture) as 
well as the inner attitude, permitted 
Kretschmer (1949) to make the terse 
statement: “Die innere Haltung ist von 
der Aussenhaltung induzierbar und 
umgekehrt.” ** 

Our interpretation of the relation 
between posture, mood, and emotional 
responsiveness is supported by experi- 
ments involving hypnosis, In numer- 
ous tests it has been impossible to 
induce a certain feeling which is un- 
related to the directly suggested motor 
attitude. If, for instance, the posture 
accompanying “triumph” is suggested 
and “locked” in hypnosis, a depressive 
mood cannot be brought about unless 
the postural setting is changed (Pas- 
quarelli & Bull, 1951). 

These experiments and considera- 
tions suggest that the mood depends 
to an important degree on posture, or 
physiologically speaking, on the quan- 
tity of proprioceptive impulses which 
by their action on the posterior hypo- 
thalamus influence the hypothalamic 
balance. Apparently not only states 
of muscle tension seen in the neurotic 
but also physiological variations in the 
degree of proprioceptive discharges 
have an important influence on the 
state of “hypothalamic tuning” and, 
thereby, on emotional responsiveness. 


14The inner attitude may be induced 
through the external posture, and vice versa. 

15 At the present state of our knowledge, 
it seems to be unwise to extend these specu- 
lations further except to point out that the 
happy as well as the tense mood is associated 
with increased proprioceptive discharges. 
Conversely, a loss in muscle tone occurs in 
such widely different moods as those of 
sadness and postprandial happiness. It is 
not unlikely, however, that not only the 
facial expressions but also the patterns of 
activity of the skeletal muscles are different 
in these different emotional states. 
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Needless to say, many other factors 
such as age and experience not yet 
amenable to further analysis are re- 
sponsible for producing the numerous 
shades of mood known to us and 
expressed by appropriate verbal sym- 
bols (Bollnow, 1956). 

Although the hypothalamic balance 
influences the general type of hypo- 
thalamic-cortical discharge which will 
appear in response to stimulation, it 
seems highly probable that specific 
emotions require, in addition, a great 
variety of stimuli. They may consist 
of numerous and complex events, but 
also relatively simple stimulus situa- 
tions may play a significant role. 


Facial MOVEMENTS AND EMOTIONS 


Since Darwin (1934) and Duchenne 
(1949), the role of the movements 
expressing emotions has been studied 
intensively. The theory of the emo- 
tions of James and Lange based on 
visceral symptoms was rejected by 
Cannon (1927, 1931) partly on the 
grounds that these processes, due to 
their slowness and lack of differentia- 
tion could not account for the great 
variety of human emotions. This ar- 
gument would hardly hold for the 
expressive movements of the face. In 
a recent review Tomkins (1962) em- 
phasizes the following facts: (a) the 
great density and early development 
of receptor-effector units in the face; 
(b) the dominance of the face in tac- 
tual experiments on children; (c) the 
lesser habituation to startle of the mus- 
cles of the face than that of other mus- 
cles; (d) the absence of fascia in facial 
muscles which makes it possible to 
contract small bundles of these muscles 
through the extensive branches of the 
facial nerves and thereby to increase 
the variety of contraction patterns. 

From the neurophysiological point of 
view, it seems probable that in the 
development of an emotion in the social 
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setting the following events play 
important role: 

1. The hypothalamic discha 
which, initiated by the sight of a 
or the sound of laughter, the threat ¢ 
an intravenous injection or the sigh 
of blood, etc., leads to alterations 
autonomic balance, muscle tone, am 
cortical excitation. 

2. The specific motor patterns of th 
facial muscles which are similar 
those resulting from stimulation of f 
hypothalamus (Sem-Jacobsen & To 
kildsen, 1960) or limbic brain (Mat 
Lean, 1955, 1960) and fundamental 
different from cortically initiate 
movements. The former are pii 
served when, on account of capsul 
lesions, willed movements of the low 
face are abolished (Martin, 1950) a 
are present also in the newborn 
which, due to fetal disturbances, 9 
cerebral hemispheres are lacki 
(Edinger & Fischer, 1913; Jung 
Hassler, 1960). 1 

3. The proprioceptive and cutaneou 
impulses elicited by tonic and pha 
changes in the facial 
muscles. s 

Sensations as well as stimuli otf 
higher order involving symbols amt 
memory processes arouse emot 
Murphy and Gellhorn’s work ( 
a, b), showing that strychninizatto 
a cortical site located in the se 
projections or association areas le 
to the appearance of strychnine 8 
in the hypothalamus, illustrates, 
least in principle, the underlying malt 
rophysiological mechanisms. d : 
tion of the hypothalamus causes 
ward and downward discharges, 
proprioceptive discharges 
from the contraction of any skel 


spinal fibers. 
17 See also Gellhorn (1957). 
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muscles are enhanced via the gamma 
system when the brain stem is aroused 
in emotion, 

Although the activation of the hypo- 
thalamic-cortical system through pro- 
prioceptive impulses is well established 
(Bernhaut et al., 1953), no specific 
experiments have been performed on 
the relation of the facial muscles to 
the hypothalamus.” In view of their 
involvement in emotion and their con- 
traction in typical emotional patterns 
on hypothalamic stimulation, it is to 
be expected that afferent impulses aris- 
ing from the face during emotional 
expression contribute to hypothalamic 
excitation and hypothalamic-cortical 
discharges. The great density of the 
cutaneous receptors in the face and the 
considerable variety of the patterns of 
contraction of the facial muscles sug- 
gest that the resulting patterns of 
neocortical excitation and hypothala- 
mic-cortical discharges will match in 
diversity that of the emotional expres- 
sions. 

The interaction at the cortical level 
between impulses originating during 
emotional expressions in tactile cu- 
taneous end organs of the face and 
transmitted through specific afferent 
systems to the sensorimotor cortex and 
the hypothalamic-cortical discharges 
originating in proprioceptive discharges 
from the facial and skeletal muscles 
have not yet been investigated. Tt is 
not unlikely, however, that work on 
the interaction between hypothalamic 
and sensory impulses involving optic 
and acoustic stimuli has furnished re- 
sults which are applicable to our prob- 
lem. These studies (Gellhorn, 1954; 
Gellhorn, Koella, & Ballin, 1954, 


18 It would be of great interest to deter- 
mine whether distinct patterns of hypo- 
halamic excitation could be aroused through 
afferent stimulation of various facial mus- 
cles, singly and in combinations character- 
istic for emotional expression. 
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1955) have shown that direct stimu- 
lation of the hypothalamus or activa- 
tion of this structure via nociceptive 
reflexes leads to a marked enhance- 
ment of the effect of either optic or 
acoustic stimuli on the cortex, Thus, 
direct or reflexly induced hypothalamic 
excitation which does not seem to alter 
the electrocorticogram increases the 
frequency and/or amplitude of the op- 
tically induced potentials. Often mul- 
tiple responses appear in response to 
the sensory stimuli during hypothala- 
mic excitation, while single potentials 
prevail in the control test (without 
hypothalamic involvement). More- 
over, the duration of the enhanced 
response is greatly increased (pro- 
longed after discharges) and even cor- 
tical areas which do not respond to 
the sensory stimulus by itself show 
marked reactions under the influence 
of hypothalamic facilitation. The mag- 
nitude of these effects is quantitatively 
related to the intensity or frequency of 
hypothalamic stimulation, and is par- 
ticularly well demonstrated when a 
small area of the sensory cortex is 


strychninized.”* 


muscles. 
of the proprioceptive facial discharges 
would probably undergo quantitative and 
qualitative changes depending on the in- 
tensity and spatial distribution of the exci- 
tation of the cutaneous area and its relation 
to the facial muscles. The diversity of the 
proprioceptive facial patterns impinging on 
the hypothalamic system in different cmo- 
tions would thus be greatly increased, 

Although the fast conducting muscle aí- 
ferents (Group I) do not project to the 
cortex (M Covian, & Harrison, 
1952), there is little doubt that movements 
or muscle tension resulting in excitation of 
Pacinian corpuscles ( McIntyre, 1962), and 
joint receptors excite the motor cortex (Gay 
& Gellhorn, 1949; Malis, Pribram, & Kur- 
ger, 1953) and modify its reactivity (Gell- 
horn & Hyde, 1953). 
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This interaction is thought to under- 
lie perception, and behavioral observa- 
tions support this interpretation (Gell- 
horn, 1957). Thus a cat which does 
not react to a certain optical or acoustic 
stimulus will follow these stimuli dur- 
ing a mild stimulation of the posterior 
hypothalamus which by itself results 
only in pupillary dilation and contrac- 
tion of the nictitating membranes.” 
But the application of these findings 
goes further. Experiments involving 
stimulation of the limbic brain have 
shown that there is a gradual transi- 
tion from arousal, during which the 
animal reacts to environmental stimuli, 
to an emotional state disclosed by 
signs of fear and anger (Chapman, 
Schroeder, Geyer, & Brazier, 1954; 
Heath, Monroe, & Mickle, 1955; Huns- 
perger, 1956). It seems not improba- 
ble, therefore, that the sensory-hypo- 
thalamic interaction in the neocortex 
accounts also for the qualitative changes 
in sensations and perceptions during 
emotional excitation. The fact that 
this facilitative action of hypothalamic- 
cortical discharges is not confined to 
the sensory projection areas and to the 
motor cortex (Murphy & Gellhorn, 
1945) but extends to the association 
areas as well (Gellhorn, 1954) sug- 
gests that it may play a role in re- 
calling past events. It should also be 
remembered that memory processes as 
well as perceptions are facilitated in 
the emotional state (McGinnis, 1949; 
Miller, 1950; Rapaport, 1942). In 
the light of this discussion, facial pro- 
prioceptive and cutaneous impulses 
seem to an important role in facili- 
tating t lex interactions between 
brain stem and li and neocortex 
which occur ition, and to 


contribute to the variety of cortical 


20 Se 0 Thompson (1963) concerning 
the loss of visual discriminative behavior in 


rats posterior hypothalamic lesions. 
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patterns of excitation which underlie 
specific emotions. 


THE SIGNIFICANCE OF Loss OF 
FACIAL EXPRESSION 


The hypothesis which links proprio- 
ceptive discharges in general and pro- 
prioceptive and cutaneous impulses 
from the facial area in particular to 
perception and emotion receives sup- 
port from unexpected quarters. Clini- 
cal psychiatric observations haye shown _ 
that in paranoid sicher 
of facial expression and changes in 
muscle tone are associated with the 
complaint of the patient that the per- 
sons in his environment behave like 
marionettes and have masks—as if the 
loss in emotional expressiveness were 
associated with a disturbance in per- 
ception (von Ditfurth, 1961). This 
suggestion is corroborated by the find- 
ing that in these conditions—in con- 
trast to states of depression—those 
phenothiazine derivatives are therapeu- 
tically effective in which the autonomic 
(tranquilizing) action is small but the 
extrapyramidal diencephalic stimula- 
tion considerable. The latter leads to 
symptoms of Parkinsonism in man and 
to catatonia in the experimental animal 
(Taeschler & Cerletti, 1959). As von 
Ditfurth (1961) points out, these drugs 
seem to modify the relations existing 
between environment and emotiona 
expression, and determine thereby 
whether a “normal” or a “paranoid 
world” results. j s 

Since the emotions from their very 
beginning—the smile of the infant in- 
duced by the mother’s smile in the 
favorable setting of postprandial mus- 
cular relaxation—evolve in a certain 
social milieu, it is perhaps not surpr!s- 
ing that the physiological mechanism 
underlying the emotions are anos 
social significance. It remains to ‘4 
seen whether the fundamental TOt 
which von Ditfurth attributes to the 
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expression of the emotions ſor the 
maintenance of normal interpersonal 
relations is justified in man in whom 
communication is chiefly attained 
through symbolic speech. Neverthe- 
less, it is of interest to call attention 
to the fact that caudate stimulation 
may change social behavior in mon- 
keys although “the motor effect may 
seem negligible to the observer [Del- 
gado, 1962, p. 221].” Since the extra- 
pyramidal system regulates expressive 
movements (Jung & Hassler, 1960; 
Kelly, Beaton, & Magoun, 1946; Ma- 
goun, Atlas, Ingersoll, & Ranson, 1937 ; 
Martin, 1950), a change in the latter 
may result from caudate stimulation 
and may account for the alterations in 
social behavior. The experimental ex- 
ploration of this area may bear fruit 
for physiology, psychology, and psy- 
chiatry, and even contribute to an un- 
derstanding of social relations. 

It has been shown elsewhere (Gell- 
horn & Loofbourrow, 1963) that the 
hypothalamic balance plays a central 
role in the neurophysiological mecha- 
nisms underlying the emotional proc- 
ess. Peripheral stimuli as well as ex- 
citation of the hypothalamus or of the 
limbic brain, to mention only a few 
factors, alter this balance and lead to 
Tepercussions in somatically and auto- 
nomically innervated peripheral struc- 
tures as well as in the cerebral cortex. 
Obviously, the emotions are determined 
by many factors and only in special 
conditions may one factor become of 
critical significance. Although emo- 
tions are facilitated through appro- 
priate posture and facial expression, a 
trained actor knows that he may evoke 
emotions in the audience without ex- 
periencing them. In stressing the mod- 
ulating rather than the causative role 
of proprioceptive and cutaneous im- 
pulses for the emotions, their practical 
Significance is hardly lessened. The 
educational and therapeutic value of 


467 


the control of the expressive move- 
ments (including that of the tone of 
the skeletal muscles) lies in the fact 
that they may be used to trigger or 
to inhibit the emotions by the employ- 
ment of relatively simple physiological 
procedures, 


SUMMARY 


In view of the fact that various emo- 
tions are accompanied by widely dif- 
ferent degrees of tone in the skeletal 
muscles and characteristic facial pat- 
terns, the role of these tonic and phasic 
changes in the activity of the somatic 
system for the physiology and pathol- 
ogy of the emotions is discussed. Pro- 
prioceptive and tactile impulses are 
conveyed via the specific afferent sys- 
tems to the cerebellum and the sen- 
sorimotor cortex. In addition, collat- 
erals from the specific afferent systems 
reach the reticular formation and the 
hypothalamus and account for the dif- 
fuse excitation of the cortex chiefly via 
the thalamic reticular system. 

Physiological experiments show: 


1. A characteristic association of 
somatic and autonomic processes inas- 
much as sympathetic discharges and 
muscle tone undergo parallel changes 
under physiological conditions, whereas 
states of increased parasympathetic ac- 
tivity are accompanied by a loss of 
muscle tone. 

2. Increased sympathetic and so- 
matic reactivity results from direct 
stimulation of the sympathetic division 
of the hypothalamus or from reflex 
excitation. The latter is most effective 
when nociceptive or proprioceptive 
stimuli are applied or when the pres- 
sure in the carotid sinus is diminished. 

3. Increased parasympathetic and 
decreased somatic reactivity result 
from stimulation of the anterior hypo- 
thalamus and also from an increased 
pressure in the carotid sinus. 
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4. Experimental evidence suggests 
that the gamma system of small motor 
fibers is activated from the posterior 
and inhibited from the anterior hypo- 
thalamus. It contributes to a further 
increase in proprioceptive impulses im- 
pinging on the posterior hypothalamus 
(positive feedback) when the sympa- 
thetic division of the hypothalamus is 
excited, whereas the opposite proc- 
esses are evoked on stimulation of the 
anterior hypothalamus. 

5. Excitation of the posterior hypo- 
thalamus, directly or reflexly, increases 
the diffuse excitation of the cerebral 
cortex, the “upward discharge“ paral- 
leling in intensity the sympathetic- 
somatic “downward discharge.” Ex- 
citation of the anterior hypothalamus 
leads to a reduction in cortical excita- 
tion (synchronization of cortical poten- 
tials). 

6. Reciprocal relations exist be- 
tween anterior and posterior hypothala- 
mus. This statement is valid for the 
hypothalamic-cortical discharges which 
determine the state of cortical excita- 
tion as well as for the downward 


discharges. 


The application of these principles 
to the study of proprioception in rela- 
tion to the hypothalamus shows that 
increased proprioceptive discharges in- 
duced by passive movements increase 
sympathetic discharges and, via the 
hypothalamus, the state of alertness 
(desynchronization of cortical poten- 
tials). Moreover, a reduction or in- 
hibition of proprioceptive impulses 
through blocking of the neuromuscular 
junction reduces sympathetic and in- 
creases parasympathetic discharges, 
and also diminishes the state of cortical 
excitation: Synchronous potentials and 
behavior indicate that sleep has been 
induced. These experiments are be- 
lieved to represent a physiological 
model of the mechanism by which 
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abnormal states of emotional tension 
are alleviated through various forms 
of “relaxation” therapy. 

It is assumed that proprioceptive 
discharges elicited by the muscles and 
the associated excitation of cutaneous 
sense organs in the skin of the face 
play an important role in the develop- 
ment of the emotions and a subsidiary 
role in their reinforcement after they 
have been established. ij 

Two processes are distinguished: 


1. The setting of the hypothalamic 
balance which, other conditions being 
equal, is determined by the posture of 
the body through the total quantity of 
proprioceptive impulses impinging on 
the posterior hypothalamus per unit 
of time. 

2. The facial contraction patterns 
which are characteristic for the expres- 
sion of particular emotion. These con- 
tractions set up complex proprioceptive 
patterns which arouse the diffuse hy- 
pothalamic-cortical system while the 
sensorimotor area in the cortex is 
excited via the specific afferent system 
from tactile and proprioceptive facial 
receptors. The mode of interaction 
between the two systems established 
earlier in Gellhorn’s studies on the 
role of the hypothalamus in perception 
is believed to involve the sensorimotor 
and the association areas and to result 
in a variety of patterns of cortical ex- 
citation which qualitatively depend on 
the configuration of the activated facial 
muscles and quantitatively on the in- 
tensity of the hypothalamic-cortical 
discharge. 
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DEVELOPMENTAL CHANGES IN PROBLEM- 


SOLVING STRATEGIES * 
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An analysis of the performance of Ss at a number of age levels, 
from 3 yr. to 20 yr., in a problem-solving task. Dependent variables 
include terminal level of correct response, rate of rise to terminal 
level of response, patterns of response, and performance as a function 
of a preceding correct or incorrect response. A U-shaped relation- 
ship between age and terminal level of correct response is reported, 
while age and simple patterns of response bear an inverted U rela- 
tionship to one another. Younger Ss show a more rapid rise 
to terminal level of response than do older Ss. In addition, dif- 
ferential effects of reinforcement and nonreinforcement are noted 
among the age groups, and differential changes occur in these effects 
as the task progresses. Data from a number of other studies are 
discussed in relation to these findings, and an attempt is made to 
consolidate various sources of evidence to provide support for the 
mechanisms which are assumed to underlie certain of the develop- 
mental changes reported. Differential growth of the ability to gen- 
erate hypotheses and employ strategies and the ability to process the 
information Ss gain from their own responding is suggested to 


The psychological literature on prob- 
m solving and related topics such as 
concept formation is vast and appears 

be growing exponentially. Attempts 
to adequately review this literature 
(eg, Chown, 1959; Duncan, 1959) 
have been hampered by the noticeable 
ence of systematic research pro- 
ms which attempt to isolate impor- 
int variables influencing this type of 
behavior. Even more scarce are de- 
pmental analyses of problem solv- 
and related topics. Most studies 
hich include children as a part of 
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account for the major developmental differences reported. 


their sample use only a limited age 
range, and the resulting comparisons 
across ages are sketchy and incomplete 
(cf. Russell, 1956). There are some 
notable exceptions, however. Piaget, 
in a number of works (e.g., Piaget, 
1937 ; Piaget, 1945; Inhelder & Piaget, 
1958), has provided modern develop- 
mental psychology with invaluable de- 
scriptions and speculations as to the 
nature of intellectual development from 
early childhood to adulthood. Excel- 
lent reviews of his work may be found 
in Hunt (1962) and Flavell (1963). 
The work of the Kendlers (e.g. 
Kendler & Kendler, 1962; Kendler, 
1963) represents a point of view to- 
ward developmental research on prob- 
lem solving considerably different from 
that of Piaget. The problem situa- 
tions which Piaget presents to his 
subjects generally take the form of 
laboratory versions of real-life situa- 
tions. The Kendlers, on the other 
hand, prefer to use less complex situa- 
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tions in an effort to discover basic 
mechanisms which they presume will 
also be important determiners of be- 
havior in more complex, real-life 
situations. 

While this paper is not a defense of 
either of these approaches, the data to 
be presented here were gathered from 
a task which fits the Kendler approach 
much more closely than it does that 
of Piaget. A considerable number of 
studies by other investigators have di- 
rect relevance to many of the findings 
which will be reported here. For the 
most part, however, these studies make 
use of a variety of tasks and reinforce- 
ment conditions, and utilize only a 
limited age range. As a consequence, 
any attempt to review them at the 
beginning of this paper might make the 
aggregate appear chaotic and unre- 
lated. For this reason, the results and 
conclusions of the most pertinent of 
these studies will be discussed later in 
the context of the results of the present 
experiment to which they seem most 
closely related. An attempt will be 
made to show that there is considerable 
agreement among the results of these 
diverse studies, and that they give sup- 
port, for the most part, to the develop- 
mental data and its interpretation 
which will be presented in this paper. 


METHOD 
Subjects 


The subjects were 290 children, adoles- 
cents, and adults ranging in age from 3 
years to 20 years. Preschool children were 
obtained from private nursery schools, and 
both elementary and secondary school chil- 
dren were obtained from private and pub- 
lic schools. As far as could be ascertained, 
these children were from neighborhoods 
consisting of middle- and upper middle-class 
families, The data to be reported repre- 
sent, for the most part, the performance of 
subjects who had been a part of a number 
of earlier studies, all of which contained 
groups receiving the same task and instruc- 
tions (e.g., Gruen & Weir, 1964; Stevenson 
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& Weir, 1959; Stevenson & Weir, 1963). 
The adult data were collected from college 
students as a part of an introductory psy- 
chology course. 


Apparatus 


The apparatus has been described in de- 
tail previously (Stevenson & Zigler, 1958). 
It consisted of a yellow panel containing a 
horizontal row of three knobs. Above the 
knobs was a signal light and below the 
knobs was a delivery hole for marbles. The 
marbles fell into an enclosed plastic con- 
tainer. 


Procedure 


Subjects were run individually. Each 
subject was seated in front of the ap- 
paratus and was told that he was to play a 
game, and that when the signal light went 
on he was to press one of the three knobs. 
He was also told that if he was correct. 
a marble would fall into the enclosed plastic 
container. The subjects were not allowed 
to handle or keep the marbles, but the 
younger children were told that if they 
won enough marbles they would win a prize 
which they could keep. Since appropriate 
prizes are difficult to find for adults and 
adolescents, this part of the instructions 
was omitted for these groups. It might 
be argued that the resulting differences in 
incentive may have affected performance, 
but so far as the experimenters were able 
to tell, motivation at all ages was high. 
All subjects were instructed that the ob- 
ject of the game was for them to win as 
many marbles as possible. The subjects 
were given a total of 80 trials each. 

For all subjects, only one of the three 
knobs ever paid off, and that only part of 
the time. Two reinforcement percentages 
for the payoff knob were used, 33% and 
66%. Choices of the other two knobs 
never resulted in the delivery of a marble. 
For each of the two percentage conditions, 
the subject was reinforced a certain per- 
centage of the times he actually chose the 
payoff knob. For example, in the 33% 
condition reinforcement was delivered on 
33% of the subject’s choices of the pay? 
knob, rather than on 33% of the total trials 
in the task. For each subject, one of 
the three knobs (either left, middle, of 
right) was designated as the correct knob. 
Equal numbers of subjects were tested wi 
each of the three knobs. In each Per 
centage condition, subjects from a number 
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of age levels were included. Table 1 shows 
the mean ages of all the age groups in the 
two percentage conditions, as well as the 
number of subjects within each of these 
age groups. 

This is a deceptive task. It appears quite 
simple on the surface, but in fact has no 
solution—at least not the kind of solution 
that most subjects seem to expect (Steven- 
son & Weir, 1963; Weir, 1962). It will 
not, for example, allow any solution which 
will yield 100% reinforcement. It will not 
even allow accurate prediction of when a 
marble will be delivered and when it will 
not, since the partial reinforcement sched- 
ule is randomized. Because the task does 
not provide a solution acceptable to most 
subjects, it makes possible examination of 
problem-solving strategies over a long series 
of trials, The solution which will yield 
the greatest number of marbles is 100% 
choice of the payoff knob. This behavior, 
which will be referred to as maximization, 
will be seen to occur as a predominant 
strategy only in certain age groups. 


ReEsuLTs AND DISCUSSION 
Terminal Level of Response 


The per cent choice of the payoff 
knob during trials 61-80 as a function 
of age and reinforcement percentage 
condition is shown in Figure 1. (For 
the sake of brevity, choices of the pay- 
off knob will be referred to in the re- 
mainder of this paper as correct re- 
sponses, regardless of whether or not 
the choice actually resulted in the de- 
livery of a marble.) Examination of 
each of the two percentage conditions 
separately revealed a significant differ- 
ence among the age groups in both the 
33% condition (F = 6.60, df = 6/163, 
p < .001) and the 66% condition (F 
=248, df=5/114, p<.05). Due 
to unequal and disproportionate sub- 
class numbers, both percentage condi- 
tions were not included in a single 
analysis of variance. A ¢ test compar- 
ing number of correct responses made 
by subjects in the two percentage con- 
ditions indicated that the terminal level 
of response of subjects in the 6676 
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condition was significantly higher than 
that of subjects in the 33% condition 
(t=8.18, df = 288, p < .001). 

The terminal level of response across 
age groups, shown in Figure 1, ap- 
pears to be U-shaped in both percent- 
age conditions. A curvilinear regres- 
sion analysis was performed in order 
to test the linear and quadratic com- 
ponents of the two percentage curves. 
Since exact age data were not avail- 
able for all subjects in some age 
groups, the mean age of each group 
(estimated from available age data) 
was used in a method similar to that 
suggested by Snedecor (1946, pp. 383- 
384). From this analysis, the variance 
due to linear and quadratic components 
was computed, and the resulting F’s 
showed the quadratic component of the 
33% curve to be highly significant 
(F = 26.28, df=1/167, p<.001). 
The quadratic component of the 66% 
curve was also significant (F = 10.12, 
df = 1/117, p < .01). The linear com- 
ponent of neither the 33% nor the 
66% curve approached significance (F 
<1 in both cases). Also plotted in 
Figure 1 is the line of least squares 
derived from the linear and quadratic 
components of the regression analysis. 
In both percentage conditions, these 
least squares lines appear to fit the 
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Two PERCENTAGE CONDITIONS 
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CHRONOLOGICAL AGE 
Fic. 1. Per cent correct responses during the last 20 trials as a function of 


chronological age 
curves are shown.)? 


data quite well. (For the remainder 
of the paper, these least squares lines 
will be referred to as the “fitted” 
curves. ) 

One interesting aspect of this U- 
shaped relationship is that 3-year-olds, 
5-year-olds, and college students all 
show similar levels of performance 
after 80 trials in this task. There is 
no question that different processes 
must have led to this final level for 
groups so discrepant in age. An ex- 
amination of the learning curves of the 
various age groups in the two percent- 
age conditions may provide one indi- 


Equations for the two fitted curves are as 
follows: 


33% condition: 
Y = 99,95 — 10.20X + .455X2 


66% condition: 
Y = 114.15 — 8.85X + .395X? 


where: 7 predicted per cent correct 
X = chronological age. 


and percentage of reinforcement. (Both data points and fitted 


cation that markedly different proc- 
esses are operating. 


Group Learning Curves 


Figures 2 and 3 present the mean 
number of correct choices per block 
of 10 trials for the various age levels 
in the two percentage conditions. 
From these two figures it may be seen 
that the performance of the 3- and 5- 
year-olds is characterized by a rapid 
rise to asymptote, while that of the 
older subjects either shows little 
change or a slower rise to terminal 
level of performance. The percentage 
of subjects in each of the age groups 
displaying maximizing behavior (ite. 
choosing the payoff knob at least 18 
out of 20 trials) during the final 20 
trials in the task may be seen in 
Table 2. 

One possible explanation for these 
differences in trends may be the man- 
ner in which subjects at different ages 


approach this task. It is likely that 
the 3- and 5-year-olds are drawn to 
the payoff button on the basis of a 
simple reinforcement notion only. 
heir tendency to respond to the pay- 
‘off button is strengthened by each 
‘reinforcement, and the tendency to re- 
Spond to the other two buttons ex- 
tinguishes rapidly in the face of sev- 
eral nonreinforcements. In order to 
invoke this explanation of the behavior 
of younger children, it is necessary to 
postulate that they do not entertain 
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_ Fic. 2. Mean number of correct responses 


per block of 10 trials for the seven age 
groups in the 33% condition. 


complex hypotheses involving patterns 
of response, nor are they particularly 
disturbed by the partial reinforcement 
‘Schedule. That is, they may expect 
neither complex nor “perfect” solu- 
tions in a task of this sort, and be- 
have accordingly. A similar result has 
been reported for children of about 
this age by Kendler and Kendler 
(1962). They conclude: “it was pos- 
le to infer that as a child matures 
he makes a transition from responding 
the basis of a single unit S-R 
anism to a mediational one.“ It 
y be that the single unit S-R mecha- 
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MEAN NUMBER OF CORRECT RESPONSES 


BLOCKS OF TEN TRIALS 


Fig. 3. Mean number of correct responses 
per of 10 trials for the six age groups 
in the 66% condition. 


nisms present in the younger subjects 
provide an inadequate base for com- 
plex strategies. Jones and Liverant 
(1960) report that younger subjects 
(4 to 6 years) showed more utilization 
of “pure” strategy (more maximiza- 
tion) in a two-choice task than did 
older subjects (9 to 11 years). In 
addition, their data clearly indicate 
a more rapid rise in choices of the 
more frequently reinforced stimu- 
lus for the younger than for the 


TABLE 2 


PERCENTAGE OF SUBJECTS MAXIMIZING 
DuRING THE FINAL 20 TRIALS 
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older children. In this case, as with 
the present data, it appears that a pure 
Strategy occurring early in perform- 
ance is indicative of fairly primitive 
problem-solving behavior. 

Older subjects, particularly adults, 
enter this task with a strong expectancy 
that there is a solution which will yield 
100% reinforcement, or at least 100% 
predictability of when a reinforcement 
will be delivered and when it will not, 
and employ complex strategies based 
on complex hypotheses concerning the 
nature of the task and the rein- 
forcement schedule (Stevenson & 
Weir, 1963). Younger children, on 
the other hand, are not concerned with, 
or more likely not capable of, such 
complex mediating mechanisms, and 
do not respond on this basis. 

This expectancy for solution that 
adult subjects show so strongly was 
encountered by Hyman and Jenkins 
(1956) who report that it was much 
more difficult to convince subjects that 
a sequence was random than it was to 
convince them that it was structured. 
A number of investigators have at- 
tempted to manipulate this solution 
set in adult subjects by varying type 
of task or instructions. Goodnow 
(1955) found less maximizing be- 
havior in a problem solving than in a 
gambling (chance) task, and Morse 
and Runquist (1960) report that 
matching behavior (nonmaximizing) 
is more common in a task which the 
E believes to have been pre- 
scheduled by the experimenter, while 
more maximizing behavior was noted 
among subjects who knew that the 
series was random. A similar finding 
has been reported by Gruen and Weir 
(1964) using instructional sets con- 
cerning randomness or nonrandom- 
ness. 

It is interesting to note that in the 

studies mentioned above, as well as 
* 
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the present experiment, the belief by 
older subjects that there is a complex 
solution actually results in fewer 
choices of the most frequently rein- 
forced alternative. As a consequence, 
those subjects behaving in the most 
complex fashion actually receive a lower 
frequency of reinforcement than do sub- 
jects who are behaving ina less complex 
fashion. The finding that older subjects 
tend to expect complex solutions, and 
are hampered in their problem solving 
by this set, is not without precedent. 
Kendler (1963) has reported her be- 
lief that in certain tasks, older sub- 
jects make the problem more complex 
than it really is, with this unnecessary 
complication leading to lowered per- 
formance. The same interpretation 
has been given to results of other 
studies using several age groups 
(Stevenson, Iscoe, & McConnell, 1955; 
Weir & Stevenson, 1959). Goodnow 
and Pettigrew (1956) propose a “re- 
sponse-hypothesis orientation” to ac- 
count for a similar difficulty in simple 
pattern learning. According to them, 
The Ss use their choices as direct tests 
of specific hypotheses rather than as tools for 
data gathering with hypothesis testing held 
in abeyance. As a result, the information 
gathered is related to a specific hypothesis, 
and if the latter should prove to be wrong, 
it is only with difficulty that the information 
can be transformed and made relevant to an- 
other hypothesis. As a rule, Ss do not 
transform information but start from scratch 
again with their next hypothesis [p. 385]. 


In the present task it appears that 
even though the adult subjects may 
have been hampered by such complex 
hypotheses and strategies early in the 
task, they were nevertheless able to 
eventually arrive at a high terminal 
level of performance. There is little 
doubt that the frequency of correct 
responses in this oldest group would 
have continued to increase had they 
been given additional trials. It seems 
reasonable to assume, then, that the 
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older subjects (the 18-year-olds, for 
example) enter this task with com- 
plex hypotheses concerning solution, 
employ complex strategies, systemati- 
cally reject these strategies when they 
do not provide a solution which meets 
their expectation, and finally arrive at 
something very close to an actual 
game solution. That is, they eventu- 
ally begin to realize that two of the 
ttons have nothing to do with the 
e and begin to maximize their 
ce of the only alternative that 
ever pays off. But this maximizing 
Strategy comes about as the conse- 
quence of a vastly different process 
than does the maximization shown by 
the 3- and 5-vear-olds. 
= The data and speculations presented 
above provide at least some manner of 
explanation for the behavior of the 
younger children (3- and 5-year-olds) 
and for the eventual maximizing tend- 
ency of the adults. It does not, how- 
ever, provide much of a clue as to 
what is occurring in the age groups 
between the two ends of the age con- 
tinuum. These “middle-aged” chil- 
dren are characterized by fairly low 
terminal levels of correct response and 
only slight increases in performance 
across trials (see Figures 1, 2, and 
3). More light might be cast on the 
behavior of this middle age range by 
examining directly some of the strate- 
gies which subjects use in this task. 


Response Patterns 


Previous experimental findings 
(Weir, 1962) have indicated that the 
most common response pattern used in 
this task (at least for some ages) in- 

_ volves some variant of a left, middle, 
tight (LMR) or a right, middle, left 
(RML) scheme. This type of pattern 
represents a simple progressive search 

across the three response buttons. 

Figure 4 presents the mean number 
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of such patterns made during the 
course of the experiment for the vari- 
ous age levels in the two percentage 
conditions, as well as the fitted curves 
for the two percentage conditions, 
Again, due to unequal and unpropor- 
tional subclass n’s, the two percentage 
conditions were not combined in a 
single analysis, An analysis of vari- 
ance performed separately on the two 
percentage conditions revealed a sig- 
nificant difference among the age 
groups for both the 33% condition 
(F = 4.83, df = 6/163, p < .001) and 
the 66% condition (F = 5.07, df = 
5/114, p < .001). A t test comparing 
mean number of patterns for all 33% 
groups with all 66% groups indicated 
that subjects in the 33% condition 
made significantly more pattern re- 
sponses of an LMR and RML nature 
than did subjects in the 66% condition 
(t=6.10, df = 288, p < .001). 

Again, as with terminal level of per- 
formance, a curvilinear relationship 
between age and performance is indi- 
cated. In the case of LMR and RML 
patterns, however, this relationship 
takes the form of an inverted U. A 
curvilinear regression analysis indi- 
cated that the relationship between age 
and performance in the 66% condition 
is indeed in the form an an inverted 
U, as the quadratic component was 
highly significant (F = 15.45, df = 
1/117, p < .001), while the linear com- 
ponent did not approach significance 
(F = 1.26). For the 33% condition, 
however, the linear component was 
significant (F = 13.40, df = 1/167, 
p < .001), while the quadratic com- 
ponent did not approach significance 
(F <1). A Duncan’s range test per- 
formed on the 33% data revealed the 
3-year-old group to differ significantly 
from the 5-, 7-, and 9-year-olds. This 
drop at the 3-year level was not suffi- 
cient to result in a significant quad- 
ratic component, however. 
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the task as a function of age and percentage of reinforcement. 


and fitted curves are shown.) 8 


From the statistical analysis pre- 
sented above and from Figure 4, it 
seems clear that patterns of this simple 
LMR and RML nature are less likely 
to occur during the 80 trials in the 
younger and older groups than they 
are in the middle age ranges. The 
number of patterns made as each 
group progressed in the task was ex- 
amined, and from this it appears that 
the 3-year-olds virtually never use 
these types of patterns; the older sub- 
jects use them some at the beginning 
of the task, but they then drop out, 
and the subjects in the middle age 
groups tend to use these patterns 


Equations for the two fitted curves are as 
follows: 


33% condition: 
7 = 13.24 + .144X — .023x2 
66% condition: 
Y = —5.97 + 3.00X — .136X? 
where: 
I = predicted number of pattern responses 


X = chronological age. 
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Fic. 4. Mean number of pattern (LMR and RML) responses made during 
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rather consistently throughout the en- 
tire task. This indicates that the very 
young children do not respond in this 
patterned way at all, and the older 
subjects, as was suggested earlier, are 
able to discard these simple patterns 
after the first time or two they are 
unsuccessful. 

The LMR and RML pattern data 
begin to throw some light on the per- 
formance of the children in the middle 
age ranges. Children approximately 
7 to 10 years of age respond in a 
highly stereotyped fashion in this task, 
employing large numbers of these 
simple patterns throughout the task. 
At least two explanations for this 
stereotyped responding appear pos- 
sible. Either these middle-aged chil- 
dren do not have more complex pat- 
terns to use when the simple ones fail, 
or they are for some reason unable 
to reject these simple patterns when 
they do not pay off and continue to 


respond in a fairly stereotyped fash- 
ion, 
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It is possible that both of these fac- 
tors are at least partially responsible 
for the behavior of the children in the 
middle age ranges. The first explana- 
tion, in effect, assumes a severe limita- 
tion on the hypothesis formation abil- 
ity of the child in the middle age 
range, while the second indicates that 
this middle-aged child may be capable 
of complex hypotheses, but he is un- 
able to make full use of the informa- 
tion available from his own respond- 
ing. This latter explanation would 
suggest that, in this task, the 7- to 
10-year-old is at a point in develop- 
ment where his ability to generate 
complex hypotheses and employ com- 
plex search strategies is growing at a 
faster pace than his information- 
processing ability, which catches up 
only at a later age. Some support for 
this notion can be found by examining 
the results of other investigators using 
similar tasks. 


Pattern Responding and Terminal 
Level of Responses—Data from Other 
Studies 


Most of the information from 
other studies which can be used for 
comparison purposes involves decision 
tasks using only two choices, both of 
which are partially reinforced. In 
this type of task, patterns of an LMR 
or RML nature are clearly impossible, 
and investigators report that simple 
alternation is the most frequently en- 
countered response pattern. If alter- 
nation in the two-choice task can be 
looked upon as the analogue of LMR 
and RML patterns in the three: choice 
task, then some developmental com- 
parison is possible by combining find- 
ings from several recent studies. 

Information on alternation behavior 
is available for the two-choice task 
for 3- and 4-year-olds from Kessen 
and Kessen (1961), 5-, 9-, and 13- 
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year-olds from Craig and Myers 
(1963), 6-, 9-, and 11-year-olds from 
Gratch (1959), 10-, 13-, and 15-year- 
olds from Ross and Levy (1958), 
and some recent data on 3- and 4- 
year-olds from our laboratory. Al- 
though each of these experiments used 
different tasks, and, in most cases, 
different reinforcement percentages, it 
is possible to examine the ordering 
among age groups within each of these 
studies as far as alternation behavior 
is concerned. Combination of these 
relative orderings may then give some 
picture of developmental differences 
in alternation behavior in the two- 
choice task. 

When the data are combined in this 
fashion, an inverted U function be- 
tween age and number of alternations 
is revealed. The shape of this curve 
is quite similar to that reported in 
the present study for LMR and RML 
patterns, with one important excep- 
tion: the apex of the inverted U for 
alternations appears to be at about 6 
to 9 years of age, while that for the 
LMR and RML patterns is at 9 to 11 
years of age (see fitted 66% curve, 
Figure 4). If this difference in the 
apexes of the two curves proves to 
be a replicable one, it may indicate 
that as the task gets less complex (re- 
duces from three to two choices), the 
age at which maximum stereotypy 
appears also decreases. This would 
provide support for the notion of dif- 
ferential growth of hypothesis forma- 
tion ability and information-processing 
capacity. In other words, the 10- and 
12-year-olds are capable, in the two- 
choice task, of processing the informa- 
tion they are receiving from the out- 
come of their alternation behavior and 
consequently reject alternation as a 
solution. In the three-choice task, 
however, the increasing complexity in- 
volyed in series of LMR and RML 
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patterns may take them beyond their 
capacity to make use of the informa- 
tion they are gaining from their own 
responses. This inability to make 
maximum use of their previous re- 
sponse patterns may be mainly a func- 
tion of insufficient memory for fairly 
long series of responses and their out- 
comes which denote the success or fail- 
ure of such a series. If recall of long 
series is not possible, subjects may re- 
turn to the basic pattern in the series 
for a new attack, and this consistent 
return may produce the pattern stereo- 
typy noted. If inadequate recall is 
responsible for this stereotypy, allow- 
ing subjects of this age to record and 
keep track of previous responses and 
their outcomes should considerably re- 
duce the number of stereotyped re- 
sponse patterns. 

If the above analysis is correct, the 
data from the two-choice task should 
reveal a U-shaped relationship be- 
tween age and terminal level of per- 
formance, with the lowest point of the 
curve occurring at a younger age than 
was the lowest point of the U-shaped 
curve found in the three-choice task. 
Combining terminal level of response 
data from the studies mentioned above, 
with the addition of a study by Cran- 
dall, Solomon, and Kellaway (1961), 
reveals this to be the case. The lowest 
point of the terminal level of response 
curve for the two-choice data ap- 
pears to be at about 5 to 7 years of 
age, while Figure 1 shows the lowest 
point of the two fitted curves repre- 
senting terminal level of correct re- 
sponse to be at 11 to 12 years of age.“ 


4 Results of a study by Jones and Liverant 
(1960) appear to be slightly out of line with 
the findings just presented. They found 
maximizing by a group of subjects 4.5 years 
of age, while children of 9 to 11 years of 
age tended to probability match. The find- 
ing of matching in the older children is in 
accord with the results of Craig and Myers 
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It should be noted at this point that 
the two response measures which have 
been discussed thus far—terminal 
level of correct response and patterns 
of response—are not necessarily inde- 
pendent of one another. If the sub- 
ject maximizes his choice of the most 
frequently reinforced alternative, he 
will be showing no LMR or RML 
patterns. Conversely, if he is show- 
ing a large amount of pattern be- 
havior, his terminal level of correct 
response will be low. An attempt 
was made to take this into account by 
examining the amount of response 
variability accounted for by LMR and 
RML patterns and expressing this as 
a fraction of the total amount of re- 
sponse variability. In doing this, it 
was possible to examine only the re- 
sponses subjects made which were not 
a part of a maximizing strategy and 
determine what proportion of those 
variable responses were taken up by 
patterns. In other words, if a sub- 
ject maximized completely during the 
entire last half of the task, this meas- 
ure of patterning would take into ac- 
count only the first half, and would 
indicate the amount of patterning the 
subject was showing when he did vary 
his responses. 

A variable response was scored any 
time the subject chose an alternative 
on Trial n+1 which was different 
from his choice on Trial n. For each 
subject, the total number of variable 
responses was computed as well as 
the number of variable responses 
which were a part of LMR or 


(1963), but the maximizing tendency of the 
younger children is in contradiction to the 
findings of Kessen and Kessen (1961), and 
recent results from our own laboratory. 
One reason for this discrepancy may be that 
Jones and Liverant used a task much dif- 
ferent from those of the other investigators, 
which may have affected the performance of 
the younger children. 
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RML patterns. When the ratio of 
variable responses in patterns to total 
variability was examined, curves 
nearly identical to those in Figure 
4 were the result. This indicates that 
the pattern of response data presented 
here does add information to that 
provided by the terminal level of re- 
sponse data, since the same inverted 
U relationship between age and per- 
formance was found irrespective of 
the amount of maximization. 


Effects of Reinforcement and Nonre- 
inforcement 


Thus far, examination of perform- 
ance in this task has taken into ac- 
count number of correct responses and 
patterns of response, and no mention 
has been made of the effect of a re- 
inforcement or a nonreinforcement on 
the subjects’ behavior. If hypotheses 
and strategies differ as a function of 
age, then the effect of a reinforcement 
or a nonreinforcement should also dif- 
fer with age. A reinforcement, for 
example, may be reacted to quite dif- 
ferently by a subject who is in the 
process of going through a complex 
pattern than it would by a subject who 
was not using a pattern strategy. In 
order to assess the immediate effect of 
a reinforcement or a nonreinforcement, 
the performance of subjects on the 
trial following a reinforcement or non- 
reinforcement was examined. Each 
subject was scored, on each choice of 
the correct knob, as having either re- 
peated that choice on the next trial 
or having switched to another knob. 
This tabulation was performed for 
the payoff knob only. 

Several types of strategies may be 
reflected in these data. For example, 
a subject who maximized his choice 
of the correct knob early in the task 
and continued this maximization would 


show repeated response to the correct 
knob following both reinforcement 
and nonreinforcement. This would be 
termed a win-stay, lose-stay strategy. 
On the other hand, a subject who was 
in the process of checking out a fairly 
complex sequential hypothesis might 
be virtually unaffected by the occur- 
rence of a single reinforcement or non- 
reinforcement, and might show any 
of a number of behaviors, depending 
on the nature of the sequential hy- 
pothesis involved. Other types of 
strategies such as win-stay, lose-shift, 
or win-shift, lose-shift, etc., would, if 
encountered, provide information con- 
cerning the effects of reinforcement 
and nonreinforcement on strategies 
used in this task. 

For each subject, the per cent re- 
sponse repetition was computed for all 
trials following either a reinforcement 
or a nonreinforcement. The average 
per cent response repetition for each of 
the age groups in the 339% condition is 
plotted in Figure 5. In this graph, any 
point above the 50% line reflects a 
tendency to repeat the previous re- 
sponse, while any point below the 50% 
line is indicative of a tendency to switch 
to another choice. 

A Lindquist Type I analysis of vari- 
ance was performed on the data pre- 
sented in Figure 5. The results of 
this analysis point up several interest- 
ing features of these data. First, the 
main effect due to age is significant 
(= ao, df =6/163, p< 001). 
There is also a tendency through age 
9 for the groups to show more re- 
sponse repetition following a reinforce- 
ment than following a nonreinforce- 
ment, while at age 11 and beyond, 
this tendency is reversed. This re- 
versal between ages 9 and 11 is re- 
flected in the analysis of variance by 
a significant age X reinforcement-non- 
reinforcement interaction (F = 6.86, 
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Fic. 5. Per cent response repetition following reinforcement and nonreinforcement 
for the seven age groups in the 33% condition (all 80 trials). 


df = 6/163, p < .001). The main ef- 
fect due to reinforcement-nonrein- 
forcement was not significant (F < 1). 

It appears then, that the younger 
children (3- and 5-year-olds) show 
evidence of a maximizing strategy, 
with a tendency to repeat their pre- 
vious response regardless of whether 
it was correct or incorrect. This is 
especially true of the 3-year-olds. The 
7-, 9-, 11-, and 15-year-olds, however, 
show a tendency toward a win-shift, 
lose-shift strategy. This probably 
comes about as a result of their pre- 
viously noted stereotyped pattern re- 
sponding, during which the occurrence 
or nonoccurrence of a reinforcement 
would have little effect on their be- 
havior. It also indicates that their 
pattern responding does not contain 
a large amount of repetition on the 
payoff knob, but instead is best repre- 
sented by some sequence of responses 


involving the other two knobs to a 
large degree. A strategy employing 
LMR and RML patterns would fit this 
definition and yield a win-shift lose- 
shift outcome. 

The greater tendency to repeat the 
previous response following a rein- 
forcement than following a nonrein- 
forcement for children through age 
9 is in agreement with results re- 
ported by Brackbill, Kappy, and Starr 
(1962) and Stevenson and Odom 
(1963). The reversal in this behavior 
which occurs at age 11 and older may 
reflect the fact that older subjects are 
adopting a rather subtle strategy based 
on the absence of long runs of rein- 
forcement in the 33% condition. Since 
this knob pays off on the average of 
only one out of every three times, 
following a nonreinforcement a payoff 
becomes more likely to occur on the 
next trial. Conversely, if it has just 


maid off, a nonreinforcement is more 
likely to occur on the next trial than 
à reinforcement. This type of 
trategy may be too complex to be 
adopted by the younger children. 
Goodnow and Pettigrew (1955) report 
a similar finding with adults trained 
in a two-choice task with only short 
s of reinforcement possible. Their 
jects showed a marked tendency 
fo persist with an unsuccessful choice. 
The same tendency has been reported 
by Jarvik (1951). 

In order to examine changes which 
might occur in the effects of rein- 


task progressed, per cent response 
ition during the first and last 
s of 10 trials only was computed, 
and is presented in Figure 6. Al- 
‘though all groups reflect at least some 
dency to increase response repeti- 
n following both reinforcement and 
nreinforcement as the task prog- 
sed, the greatest change is shown 
the adult group. In order to test 
the significance of this and other 
hanges, difference scores were com- 
puted for each subject by subtracting 
the per cent response repetition dur- 
ing the first half of the task from per 
cent response repetition during the last 
half. A Lindquist Type I analysis of 
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variance was then performed on these 
difference scores. This analysis was 
performed on first and last half of the 
task rather than on first and last block 
of 10 trials in order to include a 
greater number of responses for each 
subject from which the percentage fig- 
ures were computed. In the case of 
the 33% condition, this analysis re- 
vealed a significant change in response 
repetition associated with age (F= 
3.56, df =6/163, p < .01), indicating 
all ages did not change the same 
amount during the task. The change 
in response repetition from first to 
last half the the task was also differ- 
ent for responses following reinforce- 
ment than it was for those following 
nonreinforcement (F = 5.12, df = 1/ 
163, p < .05). The interaction be- 
tween age and reinforcement-nonrein- 
forcement was not significant (F < 1). 

Figure 7 presents percentage of re- 
sponse repetition following both rein- 
forcement and nonreinforcement for the 
six age groups of the 66% condition. 
As in the 33% condition, the difference 
among the age groups was highly sig- 
nificant (F = 4.29, df = 5/114, p< 
001). From Figure 7 it can be seen 
that the tendency to repeat the pre- 
vious response was greater following 
reinforcement than nonreinforcement 
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Fic. 7. Per cent of response repetition following reinforcement and nonreinforcement 
for the six age groups in the 66% condition (all 80 trials). 


for all ages. This difference accounts 
for the significant F associated with 
reinforcement-nonreinforcement (F = 
28.95, df = 1/114, p < .001). This 
tendency is considerably greater for 
the 7-, 9-, and 13-year-olds than it is 
for the other ages, resulting in a sig- 
nificant age X reinforcement interac- 
tion (F = 2.61, df = 5/114, p < .05). 
As with the 33% condition, the 3- 
and 5-year-olds and adults in the 
66% condition showed an overall tend- 
ency to continually choose the payoff 
button. The same strategy has been 
reported for adults by DeSoto, Cole- 
man, and Putman (1960). They found 
adult subjects to show more win-stay 
than lose-stay, but also report a pre- 
dominant tendency to stick with the 
most frequently reinforced alternative 
after either a win or loss. Hyman 
and Jenkins (1956) report a similar 


finding. The adults may be adopting 
this strategy in response to the fairly 
long runs of reinforcement which oc- 
cur in the 66% condition. Goodnow 
and Pettigrew (1955) were able to 
train this sort of strategy into their 
subjects by means of a reinforcement 
schedule characterized by long runs. 
The same sort of behavior has been 
described by Anderson (1960) and by 
Derks (1963). 

The finding of a win-stay, lose- 
shift strategy on the part of the chil- 
dren in the middle age ranges agrees 
with the results of Brackbill, Kappy, 
and Starr (1962) and Stevenson and 
Odom (1963). Since the LMR and 
RML pattern data have indicated that 
these same subjects are responding in 
a fairly stereotyped fashion, it appears 
that they not only display pattern 
stereotypy, but also, during their pat- 
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tern sequences, tend to repeat the pre- 
viously reinforced response following 
a reinforcement, and to shift follow- 
ing a nonreinforcement. A careful 
check of individual data sheets shows 
this to be the case. One discrepant 
finding has been reported by Cohen 
and Hansel (1955). They report that 
6- to 8-year-olds show more win-shift, 
lose-stay strategy, while older sub- 
jects (10 to 11 years of age) were 
not affected by the outcome of their 
previous response. The source of this 
discrepancy in results is not clear. 
As with the 33% condition, response 
repetition following both reinforce- 
ment and nonreinforcement in the 
66% condition was computed for the 
first and last block of 10 trials sepa- 
rately in order to examine changes in 
strategy as the task progressed. These 
are presented in Figure 8. Again, 
there is a general tendency for re- 
sponse repetition to increase from first 
to last block for all ages. Certain of 
the age groups show more change than 
others, particularly the 7- and 9-year- 
olds and adults, who change most 
drastically in the direction of maximi- 
zation. This differential change 
among the age groups is the source 
of a significant F associated with age 
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in the analysis of change scores from 
the first to the last half of the task 
(F = 2.66, df = 5/114, p < .05). No 
other significanct differences were re- 
vealed by this analysis. 

One other performance measure 
seems worthy of comment. In order 
to assess the degree of tendency to- 
ward response repetition which sub- 
jects bring with them into this task, 
the proportion of subjects at each 
age level who repeated their previous 
response after the first reinforcement 
or nonreinforcement was calculated. 
These proportions are plotted in Fig- 
ure 9. These are first trial data only 
(or, more precisely, second trial data, 
since they represent what the subject 
did following his first reinforcement 
or nonreinforcement), and represent 
the sets and tendencies the subjects 
bring with them to the experimental 
situation or arrive at through the in- 
structions given. These data present 
a remarkably similar picture to those 
presented in Figures 5 and 6, indi- 
cating that the 33% reinforcement 
schedule may not have been sufficient 
to produce much change in the strate- 
gies involving reinforcement and non- 
reinforcement. It should be noted 
that the data presented in Figure 9 
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Fic. 9. Percentage of subjects repeating same response following first reinforcement 
or nonreinforcement. (All percentage conditions combined.) 


are not strictly comparable to those 
in Figures 5 and 6. The data in the 
latter two figures represent per cent 
response repetition shown by groups 
of subjects summed over a series of 
trials, while the data in Figure 9 are 
simply proportions of subjects within 
each of the age groups who show re- 
sponse repetition. 


Concluding Remarks 


This paper has been an attempt to 
analyze in a fairly extensive fashion 
the performance of subjects at various 
age levels in a problem-solving task, 
and to compare these findings with 
those of other investigators who have 
used comparable tasks. Throughout, 
it has been assumed that differences 
in performance at different age levels 
or at different points during the task 
reflect hypotheses and strategies on the 


part of the subjects. Since these hy- 
potheses were not measured directly, 
they can only be inferred in a general 
way from the performance data, and 
their actual existence remains to be 
demonstrated by further research. 
This may be impossible for the 
younger subjects, who are noto- 
riously uncommunicative. Unfortu- 
nately, these subjects also represent 
an age range in which it appears that 
the greatest changes, probably associ- 
ated with rapid language development, 
are taking place. However, by 
manipulating such variables as type 
of instruction, pretraining, or incentive 
condition, it should be possible to dem- 
onstrate predictable changes in per- 
formance which would be stronger in- 
dicators of hypotheses and strategies. 

Another limitation to the data 
which have been presented in this 
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paper is that the strategies discussed 
were derived from information based 
on group performance, Obviously, not 
all subjects within each of the age 
groups went about solving this prob- 
lem in the same fashion. However, 
further research which would include 
such individual difference variables as 
intelligence and certain measures of 
parent-child interaction should make 
the prediction of individual perform- 
ance in this task possible. 
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This paper proposes a taxonomy of information-processing tasks. 
Information conserving, reducing, and creating operations are viewed 
as different methods of processing. The main concern of this paper 
is information reduction which, it is suggested, represents a kind of 
thinking in which the solution is in some way implicit in the prob- 
lem, but in which the input information must be reflected in a re- 
duced or condensed output. A number of tasks within the areas of 
concept identification and utilization are shown to have this character, 
If the tasks require complete representation of the stimulus in the 
response (condensation) the amount of information reduced is di- 
rectly related to difficulty both during learning and in utilization of 
previously learned rules. If the tasks allow Ss to ignore informa- 
tion in the stimulus (gating) the direct relation between reduction 
and difficulty is found during learning but may not occur after the 


rule is learned. 


Most applications of information the- 
ory to psychology have been concerned 
with tasks which require information 
to be conserved between input and out- 
put. In the standard reaction- time 
task the S is supposed to effect an en- 
ergy and location change, but must 
preserve all the information in the in- 
put for errorless performance (Bricker, 
1955a). Verbal-learning experiments 
add to these requirements the neces- 
sity for S to serve as a hold circuit 
and thus delay the response a speci- 
fied period of time. In studies of the 
limits of discriminability S is required 
to preserve in so far as possible all 
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stimulus differences in his judgment, 
the failure to do so is recorded as an 
error (Miller, 1956). In each of these 
tasks any change in the informational 
content during message transmission 
is clearly error. As useful as these 
applications have been it is increas- 
ingly apparent that this is not the sole 
nor even the typical information-proc- 
essing situation for the human. Thus, 
Bruner (1957) suggests that “the 
most characteristic thing about mental 
life is that one constantly goes beyond 
the information given [p. 41].” With 
the reverse emphasis, Gerard (1960) 
suggests that “the real skill of the tal- 
ented thinker is in discarding irrele- 
vancies. What is omitted in percep- 
tion, memory and reasoning is of the 
highest moment [p. 1939].” Attneave 
(1962) has been somewhat more ex- 
plicit most of the information that 
goes into the individual never comes 
out again, The information that is 
lost is not necessarily wasted, how- 
ever. The situation is somewhat like 
that of an executive who considers a 
mountain of data. . . in order to ar- 
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Fic. I. A proposed taxonomy of information- 
processing tasks, 


rive at a one bit decision [p. 634].” It 
is with these omissions, condensations, 
and reductions that this paper will be 
concerned. It is first useful to outline 
a general taxonomy of information- 
processing tasks within which the de- 
velopment of empirical relations can 
proceed. 


Taxonomy or Tasks 


Figure 1 illustrates a general tax- 
onomy for  information-processing 
tasks which includes not only the com- 
tasks but also information reduction 
and information creation. This classi- 
fication considers only the input and 
output information required for per- 
fect performance. A conservation task 
such as the standard choice reaction 
time situation, for example, requires 
S to preserve all the stimulus infor- 
mation for perfect performance. In 
actual practice he may reduce infor- 
mation in this task by stimulus equivo- 
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cation, but such a reduction is clearly 
an error as indicated in Figure 1. In 
a reduction task, however, S is re- 
quired to map more than one stimulus 
point into a single response. The loss 
of information in this situation is not 
error, but is required by the nature of 
the task. For example, the sum of a 
set of numbers has less information 
than its components, yet in losing this 
information, & has clearly not made an 
error, but rather has accomplished his 
task. In the final logical category, the 
creation task, S is required to map a 
single stimulus point into more than 
one response. Multiple association is 
of this character, but the empirical 
analysis of this type of processing is 
beyond the scope of this paper. Both 
the reduction and the creation model 
seem to capture something of the cog- 
nitive aspects of human activity. 


Previous Classifications 


The necessity for a taxonomy of this 
sort has become increasingly apparent 
in the last few years. Studies which 
place emphasis on many to one (Morin, 
Forrin, & Archer, 1961) or one to 
many (Morin & Forrin, 1963) map- 
pings of stimuli into responses have 
revealed limitations to the generaliza- 
tion so useful in conservation tasks, 
that task difficulty increases with infor- 
mation transmitted (Bricker, 1955a). 
Campbell (1958) noted that Ss tend 
to show systematic errors in informa- 
tion transmission which may make im- 
portant additions to the message OF 
which may reduce information through 
classification. Garner (1962), in his 
discussion of concept-formation tasks, 
points out that since these require 
stimulus equivocation the normal re- 
lation between total and internal con- 
straint which exists for conservation 
tasks is changed. Toda (1963) in re- 
viewing Garner’s book recognizes the 
basic nature of this distinction for task 
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analysis “and calls the conservation 
tasks “normal.” Hunt (1962) pro- 
poses that learning tasks can be di- 
vided into those which require com- 
plete information transmission (rote 
learning), information reduction (con- 
cept learning), and information pro- 
duction (probabilistic learning). This 
type of classification system captures 
something of Bartlett’s (1959) verbal 
distinction between “closed-system 
thinking” in which the solution is im- 
plicit in the problem and “open-sys- 
tem thinking” in which S uses the 
available evidence to leap beyond the 
input and provide a creative solution. 
It has a lesser analogy with the clas- 
sical division of the thought processes 
into productive and reproductive 
(Wertheimer, 1959). 


Laws of Information Reduction 


A classification system is of little 
value unless it can be shown that tasks 
falling within a category obey the 
same general laws. The study of in- 
formation-conservation tasks has been 
useful mainly because performance has 
been shown to have systematic rela- 
tions to information transmitted. While 
these relations are by no means per- 
fect, for example memory tasks show 
relatively little effect of information 
per symbol (Miller, 1956), they have 
provided important basic generaliza- 
tions from which more specific models 
have been formulated (Fitts, Peter- 
son, & Wolpe, 1963; parae brunt 
This paper proposes that for informa- 
tion reduction tasks difficulty is di- 
rectly related to the difference be- 
tween stimulus and response informa- 
tion. The author has chosen to call 
this difference the information re- 
duced. If stimulus uncertainty is held 
constant this quantity is the inverse of 
information transmitted, if response un- 
certainty is held constant and stimulus 
uncertainty varied it is, of course, in- 


dependent of information transmitted. 
The information reduced is equivalent 
to amount of stimulus equivocation in 
conservation tasks, but a distinction is 
made between equivocation introduced 
by Ss’ errors and reduction of informa- 
tion required by the nature of the task. 
This simple statement of the rela- 
tion between information reduction 
and performance allows a variety of 
imental situations to be seen as 
related through the requirement for 
Ss to operate on input information to 
produce a condensed response. Within 
a restricted task configuration it al- 
lows quantitative predictions which 
will be evaluated in this paper and 
which should serve to generate new 
experiments. It is, of course, true that 
this simple statement does not exhaust 
our ability to predict, any more than 
the relation between information trans- 
mitted and reaction time completes our 
knowledge of that task. Within re- 
stricted categories more specific mod- 
els such as the Bourne-Restle model 
for concept identification may do much 
better. However, this view provides a 
basic framework within which more 
specific formulations can be developed. 


Organization of the Paper 

This paper will examine first those 
information-handling tasks which in- 
volve the utilization of rules already 
well learned by S. Under this general 
heading both tasks which allow S to 
ignore aspects of the stimulus (gating) 
and those that require all aspects of 
the stimulus to be represented in the 
response but in a condensed form 
(condensation) will be considered. 
The second major area of analysis will 
concern classification tasks which re- 
quire the S to acquire or learn the use 
of rules. The paper examines two 
types of these tasks, first, those which 
are quite similar to rote learning ex- 
cept that more than one stimulus has 
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the same response (unidimensional) 
and second, those that meet the full 
requirements of concept learning by 
use of multivariate stimulus materials. 
Within each of these subareas both 
rules allowing gating and those requir- 
ing condensation will be examined. 


INFORMATION HANDLING 


In attempting to account for com- 
patibility effects in reaction time tasks 
Fitts (1959) speculated that, 


as the number of intervening information 
transformations increases, or as the number 
of hierarchical systems involved in the activ- 
ity increases, the greater, on the average, is 
the time required to complete an information- 
handling task and the greater is the probabil- 
ity of errors [p. 44]. 


Fitts suggests that such a speculation 
is not testable by direct observation. 
This is clearly the case for the infor- 
mation-conserving tasks which he was 
considering, since the information in 
the input and output are the same and 
the experimenter must speculate with 
respect to intervening transformations. 
In an information-reduction task, how- 
ever, the stimulus and response infor- 
mation is different and the experi- 
menter can use the size of transforma- 
tion between input and output as a 
directly observable property of the 
task. 
In 1957 Pierce and Karlin studied 
a task in which Ss were required 
either to read aloud a name or to clas- 
sify it into two categories, male or fe- 
male in one condition and animal or 
vegetable in another, by pressing a 
key. When there were only two names 
the key press was slightly faster than 
the verbal response. However, as the 
population of words increased in size 
the binary categorization required 40% 
longer than the naming response. 
The information transmitted by the 
classification task is only 1 bit regard- 
less of the size of the population of in- 


dividual stimuli, whereas the amount 
of information transmitted under the 
information conserving conditions in- 
creased monotonically with the num- 
ber of individual stimuli. The authors 
conclude that the classification must 
have taken additional processing time, 
but did not follow up this inversion of 
the general rule of increasing reaction 
time with information transmitted. 

Crossman (1953) studied card sort- 
ing under a number of conditions. 
One condition involved sorting the 
cards into two categories, red and 
black; a second condition required the 
S to sort into any of four categories, 
red face, black face, red non-face, 
black non-face; while the third condi- 
tion required the S to sort red face 
and black non-face into one category 
and the reverse into a second cate- 
gory. Each of these tasks involve a 
loss of information since they require 
classification into less categories than 
the information available in the stimu- 
lus would allow. The mean sorting 
time per card for the one S studied 
was .7 second for condition one, 1.1 
seconds for condition two, while con- 
dition three initially required 1.4 sec- 
onds and only after some training ap- 
proached the speed of condition two. 
Thus whether transmission of 1 bit is 
faster or slower than 2 bits depends 
upon whether S is allowed to ignore 
the face versus non-face distinction of 
the cards as he is in condition one or 
whether he is required to process that 
information in making his classifica- 
tion as he is in condition three. 


Gating 


We will first look for additional re- 
sults in reaction time studies which 
resemble condition one in the Cross- 
man study. Gregg (1954) and Archer 
(1954) have studied this type of task. 
In both situations Ss had to classify 
one or more binary relevant dimen- 
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sions, while ignoring one or more 
binary irrelevant dimensions. In both 
situations Ss knew the rules which al- 
lowed them to determine whether or 
not the information was relevant. 
Gregg (1954) found a small but sig- 
nificant linear increase in reaction time 
(RT) with the addition of from 1-3 
bits of irrelevant information. The 
size of the increase depended, in part, 
on the degree of compatibility between 
the relevant stimulus and its response. 
Archer, however, found no increase in 
RT due to the addition of up to two 
irrelevant dimensions. 

In this type of situation & can ig- 
nore large amounts of stimulus infor- 
mation with little or no penalty in RT. 
What penalty there is appears to be a 
linear function of the amount of irrele- 
vant information. Both of these stud- 
ies require & to ignore entire dimen- 
sions. When this is the case the way 
in which the dimensions are separated 
is probably of importance. For exam- 
ple, studies which present relevant in- 
formation to the eye and irrelevant to 
the ear (Broadbent, 1958) and in 
which S knows to attend to the visual 
input usually show little decrement 
due to increased irrelevant informa- 
tion. Obviously this would be even 
more striking with the opposite cod- 
ing, when S could merely shut his eyes 
as a direct physical gate. The studies 
of Gregg (1954) and Archer (1954) 
also involve ignoring entire dimen- 
sions, but these dimensions overlap 
spatially with the relevant ones. 

Situations which allow Ss to reduce 
information by ignoring aspects of the 
stimulus will be called gating tasks. 
The literature indicates that those 
tasks which allow gating of entire di- 
mensions show little or no increase in 
difficulty with increasing irrelevant in- 
formation, once the gating rules are 
well known. Tasks which require gat- 
ing within a dimension may show a 
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reaction-time study (after Bricker, 1955b). 


more marked increase in difficulty 
with increasing irrelevant information 
(Rabbitt, 1963) though the evidence 
here is hardly complete. In a sense the 
limiting case is composed of those 
tasks which, like Pierce and Karlin’s 
and Crossman’s condition three, do 
not allow S to ignore any aspects of 
the stimulus, but require him to rep- 
resent it in a reduced or condensed 
form. These tasks are conceptually 
different from the gating tasks, al- 
though they still involve information 
reduction and will be called condensa- 
tion tasks. 


Condensation 


Bricker (1955b) presented Ss with 
eight stimulus patterns which had to 
be classified into eight, four, or two 
response categories. Thus he consid- 
ered, in addition to the normal infor- 
mation conserving situation, a 1- and 
2-bit reduction task. However, the 
particular coding in the Bricker study 
(Fig. 2) makes it difficult to be sure 
whether or not this can be considered 
a type of gating task. The eight 
stimuli could easily be recoded, as sug- 
gested by the lines inserted in Figure 2, 
into four perceptually distinct shapes 
ignoring differences in location and 
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spacing of the elements: curved out- 
ward, curved inward, straight, and 
wedge, or into two shapes, curved and 
straight. These codes correspond ex- 
actly to the divisions which Bricker 
made in the response classes. If S can 
ignore detailed differences between in- 
dividual patterns this task would be ex- 
pected to be somewhat like three conser- 
vation situations with 1, 2, or 3 bits 
transmitted (Archer, 1954; Gregg, 
1954) and to follow the usual linear 
relations between RT and information 
transmitted. Indeed, this is the re- 
sult. It is interesting to speculate on 
what would have happened if all pos- 
sible codes were used. One might ex- 
pect that most would provide less op- 
portunity for S to ignore stimulus ele- 
ments in making the classification and 
the results would become similar to 
Crossman’s condition three and the 
Pierce and Karlin study. In addition, 
the Bricker study confounds amount 
of information reduced with changes 
in response uncertainty. It would be 
useful to separate the two effects. 
Posner (1962) has used a technique 
which allows the manipulation of 
amount of information reduced free 
from changes in response uncertainty 
and with systematic variation of the 
perceptual properties of forms within 
a response class. Original patterns of 
eight dots and distortions of these pat- 
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Fic. 3. Reaction time as a function of 
amount of information reduced. 


Micuaer I. POSNER 


terns were constructed by means of 
statistical rules so that a level of un- 
certainty between an original and its 
distortions could be stated. The S saw 
two patterns at a time and was re- 
quired to state whether they were 
members of the same class or not. 
Prior to this procedure the Ss had 
gone through a rather lengthy learn- 
ing process where they had learned to 
give pairs of patterns of varying lev- 
els of uncertainty the same names. 
The results of this learning procedure 
will be discussed later, but crucial here 
are the results of the average reaction 
times when two patterns were exposed 
which had the same name and when $ 
was correct. These data are shown in 
Figure 3. The results show reaction 
time to be linearly related to the un- 
certainty between patterns at least up 
to 40 bits. The greater the uncertainty 
which must be smeared over or re- 
duced in making the classification, the 
greater the classification time. 

Morin, Forrin, and Archer (1961) 
describe an experiment important in 
this analysis. The stimuli are slides 
with either one or two squares or one 
or two circles. Four conditions are 
crucial: conservation with two stimuli 
(1 bit), conservation with four stimuli 
(2 bit), gating in which the four 
stimuli are classified into circles or 
squares, and condensation where the 
response classes are two circles-one 
square and one circle-two squares. 
The initial level of difficulty as shown 
by RT indicates that the gating task 
was identical with the 1-bit conserva- 
tion condition, while the condensation 
task was much slower than these but 
slightly faster than the 2-bit conserva- 
tion task, Fitts and Biederman (1964) 
first replicated the experiment exactly 
and then added a new stimulus- response 
(S-R) code which he had found in prior 
testing to be more compatible. His re- 
stilts closely replicated the original study 
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in the initial difficulty except that with 
the new S-R code the reduction task 
took slightly longer than the 2-bit con- 
servation task. In the Fitts results, un- 
like the original study, the condensation 
task continues to be of equal or greater 
difficulty than the 2-bit conservation 
task over the full 2 days of training. 

The results reviewed so far indicate 
that a condensation task will be much 
more difficult in terms of processing 
time than the conservation task with 
the same information transmitted 
(Crossman, 1953; Morin, Forrin, & 
Archer, 1961; Pierce & Karlin, 1957), 
and will, after training, approach about 
the same level of difficulty as a conser- 
vation task with the same stimulus un- 
certainty (Crossman, 1953; Fitts & 
Biederman, 1964). Gating tasks, how- 
ever (Archer, 1954; Bricker, 1955b; 
Crossman, 1953; Morin, Forrin, & 
Archer, 1961) show little or no in- 
creased difficulty over their conserva- 
tion counterpart with the same infor- 
mation transmitted. Only in the Pos- 
ner (1962) study, in which stimulus 
similarity is defined in terms of un- 
certainty, has a quantitative relation- 
ship been discussed between the 
amount of information reduced and 
reaction time for a condensation task. 
In that study information was varied 
by systematic increases in stimulus un- 
certainty with response uncertainty 
constant. 

Pollack (1963) has developed the 
basic paradigm of Pierce and Karlin 
(1957) into a systematic study of the 
relationship of amount of information 
reduction to speed of classification. 
Pollack used 24 generic word cate- 
gories each having up to 24 individual 
instances, such as goat and pig as in- 
stances of animal. In the main condi- 
tions of the study Ss were required to 
classify 48 instances into from 2 to 24 
categories. When there is only a sin- 
gle instance per category the task is 
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information conserving. The results 
of these conditions show the expected 
linear relation between information 
transmitted and classification time 
(Bricker, 1955a). However, with the 
number of response categories fixed 
there is also a direct and reasonably 
linear relation between amount of in- 
formation reduction, from 1 to 4.5 bits 
per category, and classification time. 
The slope of this function increases as 
the number of response categories in- 
creases. At low levels of response un- 
certainty the effect of going from one 
instance per category (conservation) 
to two instances (condensation) is 
much greater than the general straight 
line trend due to amount of reduction, 
but the reverse is true at higher levels 
of uncertainty. 

Posner (1962) has performed a 
study which also allows the produc- 
tion of quantitative relationships be- 
tween amount of information reduc- 
tion and performance where stimulus 
uncertainty is constant and response 
uncertainty varied. In these experi- 
ments the stimulus message was al- 
ways the same, consisting of a series 
of eight numbers selected randomly 
from a population of 1 to 64 and pre- 
sented aurally to S. The S was re- 
quired either to record the numbers or 
to operate upon them by a number of 
information-reducing operations. The 
tasks were chosen so that the output 
information varied from 7.7 to 48 bits 
and so that the components of the oper- 
ations involved were familiar. The 
tasks included an information-conserv- 
ing recording condition, a task of alter- 
nating recording with summing the 
digits of a number, a partial addition 
task in which successive pairs of num- 
bers were added together, a 2-bit classi- 
fication task where the numbers were 
classified into high and odd, low and 
odd, high and even, or low and even, 
and a 1-bit classification task in which 
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Fic. 4. Percentage decline in performance 
with speeding as a function of the amount of 
information 


high and odd or low and even was an 
“A” and low and odd and high and 
even was a “B.” It was not possible to 
compare the tasks directly because er- 
rors were so different for each task. 
Therefore, the rate of change of errors 
in each task was measured as the inter- 
stimulus interval was reduced from 4 
to 1 second. The basic notion was that 
the more difficult the task the greater 
would be the rate of decline of per- 
formance with speeding. The use of 
an error measure, well-learned re- 
sponses, and an appropriate correction 
for chance levels were designed to re- 
duce the effect of changes in response 
uncertainty per se. 

Three separate experiments in- 
volved the following designs: (a) 
each S takes all tasks at one speed, 
(5) each S takes one task at one 
speed, (c) each S takes one task at 
two speeds. The results showed that 
in every experiment the rate of de- 
cline in performance increased mono- 
tonically with increasing information 
reduction. The overall relationship, as 
shown in Figure 4, is roughly linear 
with some variation between experi- 
mental designs. In general, the de- 
signs which successfully controlled for 
sequential transfer effects between the 
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tasks produced more linear relation: 
ships. 

These results again demonstrate 
condensation tasks, unlike consery 
tion tasks, cannot usefully be thoug 
of as increasing in difficulty with in 
formation transmitted. On the con- 
trary, when stimulus information 18 
held constant and S is required 
operate upon it to produce a d 
densed output as in this last study, 
difficulty declines with increasing in 
formation transmitted. If stimulus i 
formation is increased with response 
information constant as in the tasks 
measuring reaction time to patterns 
(Posner, 1962) or in the classification” 
of words (Pollack, 1963) then infor- 
mation transmitted is fixed, but task” 
difficulty increases. In all these cases, 
however, task difficulty is found to be 
an increasing linear function of amount 
of information reduced. 

One problem with many of the stud- 
ies cited here is that they do not ade- 
quately control the degree of learning. 
In all cases the operations or classifi- 
cations were well known by Ss but 
most studies cannot show that they 
were equally well known or that learn- 
ing effects were unimportant during 
the experimental conditions. It 4s, 
therefore, important to look at the ef- 
fect of information-reducing tasks in 
a situation where the process of learn- 
ing rules is observed. This will be dis- 
cussed in the next section. 4 


CLASSIFICATION LEARNING 
Unidimensional Rote Learning 


Rote learning, like the standard re- 
action time task, requires information 
conservation. The minimum require- 
ment for changing a rote-learning task 
into one of classification learning is to 
have more than one stimulus per re- 
sponse. When the stimuli lie along 4 
single dimension the E has mini 
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control over the relevant and irrele 
vant aspects of the stimulus, but he 
can control the number of stimuli to 
be assigned a single response and the 
way in which the stimuli within a re- 
sponse category are grouped along the 
dimension. 

Number of Stimuli per Response. 
One way to vary the amount of infor- 
mation reduction in a classification 
task is to increase the number of 
stimuli per response. Most investiga- 
tors who have attempted this have 
kept the number of stimuli constant 
and reduced the number of responses. 
Hake and Erikson (1955) and Bricker 
(1955b) studied the learning of pat- 
terns of lights when eight, four, and 
two response categories were used. 
Richardson (1958) studied the as- 
signment of 16 adjectives to two, four, 
or eight response categories. In none 
of these studies were significant dif- 
ferences found in the rate of learning 
as a function of the number of stimuli 
per response, once correction had been 
made for differing chance levels. 

There are several problems in study- 
ing the information reduction involved 
in unidimensional rote concepts. First, 
all of the studies cited confound the 
amount of information reduction with 
the number of response terms to be 
learned and used. Despite this, the re- 
sults do indicate that an information- 
reduction task is more difficult to 
learn than the conservation task with 
the same information transmitted, but 
about the same difficulty as a conser- 
vation task with the same stimulus un- 
certainty. Thus, when Es look at the 
classification of a fixed list of stimuli 
into a variable number of responses 
little difference is found among condi- 
tions. This is in agreement with the 
results in many reaction-time studies 
(Crossman, 1953; Fitts & Biederman, 
1964; Morin, Forrin, & Archer, 1961). 
Future studies should run control 
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groups with conservation tasks of the 
same response uncertainty as the reduc- 
tion tasks as well as at the same stimu- 
lus uncertainty. Second, these studies 
allow no clear distinction between gating 
and condensation. As pointed out 
earlier, Bricker (1955b) used classifi- 
cations which allow the opportunity for 
recording rules which could eliminate 
processing of certain aspects of the 
stimuli. It is well known that verbal 
synonyms (Richardson, 1958) are also 
subject to such mediational rules. How- 
ever, in these experiments it is not 
possible to tell if Ss are using such 
rules or when in the learning they 
come into play. Finally, the most seri- 
ous difficulty with these studies is that 
they do not take into consideration the 
relationship between the stimuli to be 
assigned a single response. That is, 
both the number and the way in which 
stimuli are grouped should effect the 
total information reduction involved. 
Stimulus Grouping. That the num- 
ber of stimuli per response can be a 
potent factor when there is more con- 
trol over coding within a stimulus 
grouping is shown in a recent study 
(Smith, Jones, & Thomas, 1963). 
This study used eight stimuli varying 
only along the hue dimension. There 
were either two, four, or eight re- 
sponse categories. When stimuli were 
assigned at random to response cate- 
gories the opportunity for recoding by 
use of familiar color names was mini- 
mal. In this situation performance de- 
clined uniformly with increasing infor- 
mation reduction. This is in agree- 
ment with the general relation between 
information reduction and perform- 
ance, even though as information re- 
duction increased the number of re- 
sponses to be learned declined. If ad- 
jacent colors were assigned the same 
response the task becomes one of ab- 
solute discrimination and performance 
increases as fewer categories are re- 


: 


500 MICHAEL 
quired (Miller, 1956). This study in- 
dicates that both the total information 
reduction and the way information is 
distributed within response categories 
is important to predictions of task 
difficulty. One can vary the number 
of stimuli per response, but if the 
added stimuli are very similar or if 
they can be coded by a well-learned 
name, little or no change is made in 
task difficulty. 

Some investigators have varied the 
informational properties of the task by 
manipulating the variability of the 
stimuli within a response category. 
Richardson (1958), using meaningful 
adjectives, found that as the similarity 
of items within a response category in- 
creased learning greatly improved; the 
reverse occurred with increases in 
similarity between categories. Shepard 
and Chang (1963), using colors previ- 
ously graded in similarity by use of 
paired comparisons, assigned subsets 
of four stimuli to the same response 
category. They found that the more 
similar the subset assigned to a re- 
sponse category the faster the learning 
and that such differences in within re- 
sponse variability accounted for 78% 
of the learning variance. 

French (1953) systematically studied 
the effect of stimulus similarity within 
and between response categories. He 
varied the similarity between the six 
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Fic. 5. Rate of transfer as a function of 
amount of information reduction. 
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visual forms assigned to each of two 
categories by varying the number of 
line segments which were identical. 
In the discrimination situation only 
half of the set of six similar figures 
were in the same response category, 
while in the generalization situation 
they were all in the same response 
category. These results show that per- 
formance is directly related to the de- 
gree of similarity within a response 
category and inversely related to simi- 
larity across different categories. 

Posner (1964) has shown that it is 
possible to express the degree of simi- 
larity between patterns in terms of 
uncertainty. Psychophysical studies 
showed a linear relation between the 
amount of uncertainty and perceived 
distance for patterns of eight dots. 
Such patterns were used in a paired- 
associate study in which Ss had to 
transfer the name learned for one pat- 
tern to a new pattern at a given level 
of uncertainty. It was found that the 
degree of uncertainty between patterns 
was linearly related to rate of transfer 
(Fig. 5). 

The result of this study is crucial 
for three reasons. First, it shows that 
rate of learning is a decreasing linear 
function of amount of information 
within a response category. This is 
in agreement with studies of stimulus 
similarity cited above, as one would 
predict from the linear relation be- 
tween uncertainty and perceived simi- 
larity, and with the results of reaction 
time studies cited earlier. Second, it 
allows future studies to combine the 
number of stimuli per response and 
uncertainty between stimuli to obtain 
overall relation between within cate- 
gory uncertainty and rate of learning. 
Third, it shows how studies of uni- 
dimensional classification learning can 
be systematically compared with multi- 
variate concept-learning tasks whi 
have been explored in information 
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terms. These relationships will be dis- 
cussed further in the next section. 


Multivariate Concept Learning 


The studies which we have been 
discussing form a bridge between the 
investigation of rote learning and the 
investigation of traditional concept for- 
mation. They are similar to the con- 
cept-formation design in that a single 
response is made to more than one 
stimulus. However, they lack one at- 
tribute of most concept formation stud- 
ies (Garner, 1962) namely, they are 


not multivariate in nature. Garner 
says, 
this multivariate requirement... makes it 


possible for the experimenter to determine 
a priori which variables are relevant to 
changes in the response and which variables 
are irrelevant to these changes. Then the S 
is given the task of determining which vari- 
ables are relevant and once he has learned 
this fact he can respond correctly to a large 
group of stimuli with a single response by 
ignoring stimulus differences which exist 
with respect to the irrelevant stimulus vari- 
ables [pp. 311-312]. 


In these types of studies it becomes 
possible to distinguish rather clearly 
between gating and condensation. 
Metzger (1958) studied paired- 
associate learning of patterns which had 
either three or four distinct elements. 
Each element was a triangle which 
could be either large or small. In the 
three element condition he studied 
conservation tasks with eight and four 
stimuli, a gating task with four re- 
sponse categories (one irrelevant ele- 
ment) and a condensation task with 
random assignment of the eight stimuli 
to four responses. The results showed 
that the gating and condensation tasks 
were not much different from the con- 
servation task with the same stimulus 
uncertainty though much more diffi- 
cult than the conservation task with 
the same response uncertainty. This 
is in agreement with most of the uni- 


dimensional results and agrees with 
the reaction time results except with 
respect to the gating task. In the four 
element conditions it is possible to 
make some quantitative comparisons 
of the effects of increasing information 
reduction on rate of learning for both 
gating and condensation operations. 
This is shown in Table 1. The con- 
servation condition is for eight stimuli 
and eight responses, the 1-bit reduc- 
tion point represents tasks with eight 
stimuli and four responses, while the 
2-bit reduction point represents 16 
stimuli and four responses. The re- 
sults show that both types of reduc- 
tion tasks are somewhat more difficult 
than the conservation task with identi- 
cal stimulus uncertainty and that both 
increase in difficulty with increasing 
information reduction. 

Gating. Information measurement 
has been systematically applied to the 
investigation of concept identification 
with multivariate stimuli (Archer, 
Bourne, & Brown, 1955; Bourne & 
Haygood, 1959; Walker & Bourne, 
1961). These studies are gating tasks 
since the Ss have to learn to ignore 
certain irrelevant aspects of the stimu- 
lus and then classify all levels of rele- 
vant information into distinct cate- 
gories. The results of these studies 
show that errors to criterion are an 
increasing linear function of the amount 
of irrelevant information which must 
be ignored in making the response 


TABLE 1 


MEAN TRIALS TO CRITERION 
(AFTER METZGER, 1958) 


Information reduced (bits) 


0 1 2 
Gating 25 38.4 
21.2 
Condensation 40.4 51.2 


$02 


classification. This is quite consistent 
with the results obtained by Metzger 
(1959) for paired-associate gating 


tasks with constant uncer- 
tainty. These results indicate that the 
clear distinction between gating and 
condensation tasks is lost in concept- 
learning studies. Both are increasing 
linear functions of the amount of in- 


since the & must learn what aspects of 


vant information on sing time is 
ask Aer A 

In the concept-identification experi- 
ments cited above the number of rele- 
vant stimulus levels always equals the 
number of responses. Battig and 
Bourne (1961) have discussed this 
limitation as follows, “one shortcom- 
ing of the concept-identification task, 
and of many other procedures used 
in experiments purportedly concerned 
with concept formation, is that in a 
real sense they do not require con- 
cepts to be formed. This is because 
all of the stimuli to be classified with 
the same response are identical with re- 
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to the crucial or relevant charac- 
teristics [p. 329].” 

Condensation. Battig and Bourne 
(1961) have attempted to remedy this 
defect by varying both the number of 
irrelevant dimensions and the vari- 
ability within a response category. In 
order to do this they compared, for 
example, a situation where a single re- 
sponse was made to right, equilateral, 
and obtuse triangles and another to 
squares, trapezoids, and parallelo- 
grams. In this situation a very sim- 
ple recoding (four sides versus three 
sides) could allow S to ignore some 
of those differences. To determine the 
actual extent of such recoding the 
stimulus patterns should be tried in a 
reaction time task to find out if after 
learning the RT would approach that 
for a one bit conservation task. In 
any case, regardless of the ability of S 
to ignore such differences, the results 
so far reviewed in the learning of 
concepts (Archer, Bourne, & Brown, 
1955; Metzger, 1959) would indicate 
that a linear relation should exist be- 
tween the amount of information re- 
duced in forming the response class 
and rate of learning. Hunt (1962) 
analyzed the Battig and Bourne data 
and found that the relation between 
errors to criterion and information 
within a response category is linear. 

The concept-learning study most 
similar to the condensation tasks dis- 
cussed in section one is that of Shep- 
ard, Hovland, and Jenkins (1962). In 
these experiments there were always 
eight stimulus patterns, consisting of 
two levels on each of three dimensions. 
There were always two response cate- 
gories, The problems differed in the 
way in which the information relevant 
to the classification was distributed 
across the dimensions. In one prob- 
lem all the information involved only 
one dimension, in another all informa- 
tion involved two dimensions simul- 


INFORMATION REDUCTION IN THe ANaLysts or Srounxriat. Tasxs 503 


tancously, a third type had all three 
dimensions relevant, and a final type 
involved some of the information in 
one, two, and three dimensional terms. 
The total amount of relevant informa- 
tion within a response category could 
be calculated by weighting the propor- 
tion of information by the number of 
dimensions which had to be attended 
in order for it to be extracted. By this 
method the four types of problems had 
1, 2, 2.3, and 3 bits of relevant infor- 
mation, respectively. Figure 6 shows 
the success of concept learning as a 
function of the amount of information 
condensed in obtaining the binary clas- 
sification, The points of this figure are 
from Tables 4 and 6 of the original re- 
port and represent averages over dif- 
ferent sets of a given problem type 
and over different display conditions. 
This result indicates that condensation 
is more important than gating when 
they are opposed, as in this task. 
There is some indication that with 
more extensive training than that 
shown in Figure 6 the 2-bit reduction 
tasks begin to become easier than some 
of the 1.3-bit tasks. 

In the learning of concepts the 
amount of information reduced ap- 
pears to be a useful summarizing vari- 
able. As well as having a consistent 
relation to task difficulty it also dem- 
onstrates the close relation between 
unidimensional and multivariate con- 
cept learning both conceptually and 
empirically. This measure and the 
taxonomy based upon type of infor 
mation processing should be of signifi- 
cant value in an integrated attack 
upon complex information processing. 
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THEORETICAL NOTES 


A POWER COMPARISON OF THE F AND L TESTS—I 
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WALTER R. STELLWAGEN 


Michigan State University 


The report deals with the behavior of the omnibus F test in a restricted 
hypothesis space and attempts to answer the question “Is it possible 
that a nonparametric test may be considerably more powerful than the 
F test even under conditions of strict normality and equality of vari- 
ance?” An empirical power comparison was made between the F and 
L tests by generating populations having specified ordered relationships 
and equal variances for various combinations of treatments and replica- 
tions. The percentage of rejections of the null hypothesis was plotted 
for both tests for each set of parameters. The power comparison shows 
the nonparametric L test to be more powerful than the omnibus F test 
when there is a monotonic order among the treatment means. In con- 
clusion, it is suggested that when theory and/or a priori considerations 
indicate ordered treatment affects, the experimenter consider the use 


of statistical techniques which deal with restricted hypothesis spaces. 


The testing of ordered hypotheses is of 
special interest to behavioral scientists 
because many theories and a prior con- 
siderations suggest ordered treatment 
effects. In spite of this, few statistical 
tests are available for testing such 
hypotheses. Many tests currently used 
are essentially generalizations of tests for 
evaluating the significance of differences 
between treatment means. 

Probably the only adequate two- 
sample parametric test which is available 
for evaluating ordered hypotheses is the 
one-tailed ¢ test. This test is quite 
powerful when the assumptions of nor- 
mality and homogeneity of variance are 
met and reasonably powerful when these 
assumptions are violated as long as the 
variances are equally weighted and the 
distribution is not too extreme. 

There are also a number of two-sample 
one-sided nonparametric tests available 
for testing ordered hypotheses. Among 
these are the binomial test, the sign test, 
the Mann-Whitney U test, and the 
Kolmogorov-Smirnov two-sample test 
(Siegel, 1956). 

It is generally assumed that the one- 


1 Now at the United States Army Personnel 
Research Office, Washington, D. C. 20540. 


tailed ¢ test is more powerful than its 
nonparametric counterparts when the 
underlying assumptions are met.“ It has 
been shown, however, that when sam- 
pling is from nonnormal distributions, 
the U test tends to be as powerful as the 
t test (Boneau, 1962). 

For more than two treatments, the 
problem of testing directional hypotheses 
becomes somewhat more complicated. 
The omnibus F test is generally used in 
such situations since it is considered to be 
the most powerful test available for 
testing for differences between means. 
It should be noted, however, that an 
omnibus test is inappropriate for testing 
ordered hypotheses since it is affected by 
mean differences in any direction and 
will reject Ho with high probability even 
when the ordered hypothesis is false. 
Either because of the assumption of best 
power or because of a certain naiveté 
concerning the theory of statistical tests, 
which expresses itself most clearly in the 
phrase one- versus two-tailed tests,” a 
practice has arisen in the psychological 
literature of performing a directional 


2 That the ¢ test is not uniformly most 
powerful at the usual significance levels has 
been shown by Lehmann and Stein (1948). 
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statistical test only if an omnibus test 
has already led to the rejection of the 
null hypothesis. This paper will only 
concern itself with the problem of power. 
It might be well to examine, however, 
the issue of tests of directional versus 
nondirectional statistical hypothese 


Under the restriction L m: = 0, the 
i=1 

hypothesis space of the omnibus F test 
is a k — 1 dimensional space. The null 
hypothesis consists of the origin of the 
space and the alternative hypotheses con- 
sist of the remainder of the space. Be- 
cause the omnibus F test has best 
average power on the surface of Eucli- 
dean spheres about the origin of this 
space, it is probably the most appropriate 
test to use when all possible points in the 
alternative hypothesis space are of equal 
interest. However, it is rarely the case 
that all points are equally of interest; a 
few large differences are usually more 
interesting than many small ones, direc- 
tion is usually of some importance, etc. 
Therefore, various nonomnibus tests 
have been devised. These tests will have 
better power than the omnibus F test in 
certain regions of the hypothesis space 
and lesser power in other regions. 

In psychological literature, arguments 
concerning directional tests are most 
commonly phrased in terms of signifi- 
cance level.” The use of a restricted test 
is felt to (spuriously) “increase the 
significance” of any given result. In the 
case of the £ test, the argument seems 
almost reasonable. The critical region 
for the one-tailed t test contains all the 
points in the critical region of the two- 
tailed t test for which the sample mean is 
in the hypothesized direction. We must 
note, however, that there are samples for 
which the two-tailed test will reject H, 
and the one-tailed will not. In the 
general case, the argument loses all its 
force. The restricted tests have critical 
regions of entirely different shapes from 
the omnibus tests. The choice of a test 
should be made on the basis of its power 
for the alternative of interest. Tests 
should be compared only with respect to 


the same hypothesis space. Only tests 
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with equally sized critical regions should 
be compared. 

The question left then is that of power. 
Is there a test which is more powerful 
than the omnibus F test when the 
hypothesis space is restricted? The 
answer is obviously yes. However, it 
might be of some interest to demonstrate 
that even under conditions of strict 
normality and equality of variance a non- 
parametric test may be considerably 
more powerful than the omnibus F test. 
This is the problem which will be con- 
sidered in this paper. Attention should 
first be directed, however, to statistical 
techniques available for handling k 
treatments. 

The orthogonal polynomial analysis 
for the comparison of curves in the 
analysis of variance is a parametric 
approach to this problem. Grant (1956) 
presents statistical tests for various 
polynomial components of the treatment 
sum of squares and advocates the use of 
such tests when the assumptions of 
independence, normality, and equal in- 
tervals between the levels of the inde- 
pendent variable are met. This regres- 
sion approach allows one to determine 
whether there exists an overall trend. 
Edwards (1960), McNemar (1962), and 
Winer (1962) explain this approach 
further. 

Bartholomew (1961) has developed a 
technique for testing ordered hypotheses 
without requiring the assumption 0 
equal intervals between the independent 
variables. Bartholomew’s test is su- 
perior to the omnibus F test when a 
priori considerations suggest ordered 
hypotheses and appears to be the best 
multiple treatment parametric test avail- 
able for testing such hypotheses. 

Nonparametric methods which make 
use of the one-way analysis of variance 
approach are given by Armitage (1955) 
for nominal data and by Jonckheere 
(1954a) for ordinal data. 

Probably the most widely used non- 
parametric technique for situations m- 
volving more than two treatments 16 
Friedman’s (1937) two-way analysis of 
variance by ranks for related samples. 
This chi-square test assumes only an 
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ordinal scale of measurement and tests 
whether the samples could have come 
from the same population. Friedman 
(1940) finds this test to be as powerful 
as the omnibus F test when the assump- 
tions and requirements of the latter 
are met. 

Jonckheere’s (1954b) two-way analysis 
of variance test is based upon Kendall's 
“average tau.“ It is the difference be- 
tween a predicted theoretical ranking and 
a number of judges’ rankings. Another 
ranking statistic is Page’s L test (1963) 
which is specifically designed for testing 
ordered hypotheses in the randomized 
blocks design analysis of variance. It is 
computationally much easier than Jonck- 
heere’s test and is favored over its 
parametric counterpart in that it re- 
quires only an ordinal scale of measure- 
ment. 

The L test is closely related to Fried- 
man’s xa, the randomized blocks design 
analysis of variance, regression analysis, 
and correlation. Page shows this rela- 
tionship and gives tables associated with 
the use of his test. He suggests use 
of the L test as a replacement for Fried- 
man’s test. A limited power comparison 
made by him shows L to be more power- 
ful than Friedman’s test for testing 
ordered hypotheses. Since Friedman 
found a negligible power difference be- 
tween his test and the omnibus F, we 
may assume that the L test will be 
at least as powerful as the omnibus F 
test. 

The purpose of the present study is to 
show empirically that the nonparametric 
L test is more powerful than the omnibus 
F test when a priori hypotheses suggest 
ordered relationships, even though the 
assumptions of normality and homo- 
scedasticity have been met. A secondary 
problem which will be considered in the 
future is a power comparison of the L 
test with an F test for linear trend. 

As noted above, the most frequent 
practice is to perform the omnibus test 
first and follow with a directional test 
only if the omnibus test leads to a rejec- 
tion of He. We do not maintain that 
this is a good practice, but it is almost 
universal in application. We note, 


therefore, that the power of the method 
most commonly in current use cannot ex- 
ceed that of the omnibus F test. A com- 
parison of L with a parametric analog 
may be of considerable interest. If the 
means are ordered and assumptions of 
normality and homoscedasticity are met, 
the question is raised, “How does the 
power of the omnibus F test compare 
with that of L?” If power differences 
are negligible we might expect that L 
might be more powerful under conditions 
of nonnormality and heteroscedasticity. 


METHOD 


The procedure used in the present research 
involved calculating a large number of F and 
L values, each based upon samples which were 
drawn at random from distributions having 
specified characteristics, and constructing the 
power functions of the obtained values. The 
study was performed on Control Data Cor- 
poration 160 and 3600 electronic computers 
programed to perform the following opera- 
tions: 


1. Generation of a uniformly distributed 
random number 

2. Transformation of a sequence of these 
numbers into a random deviate from a normal 
distribution 

3. Successive generation of random devi- 
ates until a sample of certain size is reached 
and transformation of these deviates by 
adding mean values 

4. Calculation of the F and L value based 
on the sample 

5. Calculation of the power of the F and L 
tests on the basis of a series of such samples 


The complete sequence of operations was per- 
formed within the computer. The end result, 
the analysis of variance tables, L values, and 
the power of the respective tests, were printed 
by the computer. 

Amplification of the above operations may 
be relevant and will be made in the order 
just given. 

1. The random number consisted of 8 
digits, each being equal to the sum of the 
previous two random numbers mod 108. To 
start the process it was necessary to place in 
the machine two 8-digit random numbers se- 
lected from a table of random numbers. 

2. In order to obtain a random normal 
deviate, a sequence of 10 uniformly dis- 
tributed random numbers obtained in the 
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above fashion was added? to produce a single 
random deviate having zero mean and unit 
variance. The medians, means, and vari- 
ances of the deviates generated in this way 
were tested by the usual tests and found to be 
within the limits of chance. 

3. The number of replications per treat- 
ment (N) selected in the present investiga- 
tions was 5, 10, and 15. The number of 
treatments (k) was 3, 4, and 5. 

The major problem in computing power in a 
situation involving k groups is the selection 
of mean values, pi- Without loss of generality, 


assume © = = O ande = 1. The hypoth- 
i=l 

esis space in this experiment is then the origin 
plus the smaller Region R of the k-1 dimen- 
sional space bounded by the hyperplanes 
m= pj, i * j. A line passing through the 
origin represents the condition that the x; be 
in a specified ratio. The distance of a point 
from the origin is the noncentrality parameter 
n, utilized in computing the power of the F 
test. While the power of the omnibus F test 
depends only upon , the power of other tests 
may depend on the w; themselves. Among 
the class of tests which depend only on n, the 
F is known to be uniformly most powerful. 
Also, F is a test with maximum average power 
on such spheres. However, other tests prob- 
ably have better average power on other 
surfaces. Short of a point by point compari- 
son, there appears to be no way to offer a 
contrast fair to both tests. It was, therefore, 
decided to select arbitrarily a line passing 
through the origin, to select points along this 
line, and to plot the power against n. This is, 
at least, fair to F since for fixed n the power of 
Fis uniform. The line selected was the line de- 
fined by the points m=0; wis1=A(us+1). 
Points on the line were selected by multiply- 
ing the coordinates of the point (0, 1, 2 . k) 
by a scalar A. 

4. The formula used for the calculation of 
L was the following (Page, 1963): 

k n 


L = È (Y; È XN 
7 i 


3A sequence of random deviates (X;) is 


10;—1 
given by X;=( © R —5.0) (VI), 
#=10(j—1) 


where R;-1 = mR(mod 108) and m is some 
prime number. 
4Scheffé (1959, p. 48). In the general 
n 2 (ui — po)? 


case j would be 5 in the present 


case this reduces to Tun, since c = 1 and n 
and p. are fixed. 
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where V, is the hypothetical ranking of 
* 


jth treatment and & X;; equals the sum 


+ 
ranks observed in the jth column. F 
calculated from the appropriate formula for 
the randomized blocks design analysis of 
variance. j 

5. For every combination of number of 
treatments, replications, and mean differences 
between treatment groups, 25 samples were 
generated. The power of each test was given 
by the proportion of times it rejected the 
null hypothesis. 

To summarize, random samples were drawn 
from populations which were normally dis- 
tributed but had different means. For several 
combinations of treatments and sample sizes, 
Fand L values were calculated and the power 
of each test computed as a function of the 
percentage of rejections of the null hypothesis. 


RESULTS 


Samples were generated for the various 
combinations of treatments and sample 
sizes. The percentage of times that the 
null hypothesis was rejected by F and 
by L was calculated for each combina- 
tion at three significance levels. Run- 
ning averages of these power functions 
were plotted against the logarithm (4 
cycles by 10 to the inch) of the non- 
centrality. The difference between the 
L and F curves for a particular alpha 
level empirically shows the power differ- 
ence between the two tests. For an 
explanation of power and noncentrality, 
see Scheffé (1959, pp. 38-39). i 

Looking at all curves we note the 
power differences are greater in the 
middle than at the ends. This is to be 
expected, however, since all power curves 
must be at height alpha at He and most 
curves are at height 1.0 as noncentrality 
increases to large values.® 

The first case to be considered is that 
of three treatments. Figure 1 shows the 
power curves associated with five replica- 
tions per treatment. L is shown to be 
considerably more powerful that F° at 
the .05 and .01 levels. This is especially 
true for noncentrality values between. 50 


Zero noncentrality was not run since 
log 0 = — œ. K 
® Unless otherwise indicated hereafter “F” 
refers to the omnibus F test, 
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Fic. 1. 


and 4.50. At the .001 level L was found 
to be consistently less powerful then F, 
especially between noncentrality values 
of 2.00 and 24.5. 

When the number of replications per 
treatment is 10, there is a very small 
power difference in favor of the L test 
at all alpha levels. The greatest power 
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Power curves for the F and L tests plotted against » when k = 3 and N = 5. 


differences in favor of L were found at 
the .05 and .01 levels with small non- 
centrality values. The above tends to 
indicate that when a priori considerations 
suggest small monotonic differences be- 
tween three treatment effects, L would 
be preferred over F at the .05 and .01 
levels. There is no apparent power 


1125 20 


7 


Fic. 2. Power curves for the F and L tests plotted against n when k = 3 and N = 15. 
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Fic. 3. Power curves for the F and L tests plotted against » when k = 4 and N = 5. 


difference between the L and F tests at 
the .001 level with 10 replications. 
When the number of replications 
equals 15 (see Fig. 2), L tends to be 
consistently more powerful than F at the 
.05 level. This is particularly evident for 
noncentrality values between .03 and 2.0. 
The power difference at the .01 and .001 
levels also favor L. However, very little 


POWER 


power can be gained by the use of L at 
these levels. > 

These results indicate that L will be 
considerably more powerful than F when 
the number of replications is small for 
noncentrality values greater than 125. 
The power difference below this point 
appears negligible. When the number 
of replications increases both tests be- 


11.25 


Fic. 4. Power curves for the F and L tests P ment bea dandy = i 5, 
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come more powerful for detecting small 
noncentrality values with L remaining 
more powerful than F. This is consistent 
with the well-known rule that if a priori 
considerations indicate small differences 
between treatment means it would be 
wise to increase sample size somewhat in 
order to detect these differences, 

When the number of treatments was 
increased to four, the power difference 
favoring L became more noticeable. 
Figure 3 (k = 4, N = 5) shows a large 
power difference betwen noncentrality 
values of .078 and 5.0 with the magni- 
tude of the difference decreasing as ņ 
increases since, of course, both power 
curves must be asymptotic to 1.0. 
When noncentrality falls below .078 the 
magnitude of the power difference be- 
tween F and L decreases, however, L is 
still the most powerful. Figure 4 
( = 4, N = 15) shows power curves 
similar to Figure 3, the only exception 
being that L shows a greater power 
difference at the .05 and .01 levels for 
small mean differences. At the .001 
level, when noncentrality was equal to 
or greater than .078, the power difference 
favoring L was much greater than in the 
previous case. This is consistent with 
the findings for three treatments. 


POWER 


Fic. 5. Power curves for the F and L tests plotted against y when 


511 


In summary, for four treatments L is 
more powerful than F in all cases with 
the maximum power differences occurring 
at low and intermediate values of non- 
centrality. For large values of noncen- 
trality both tests are extremely powerful 
with L remaining slightly more powerful 
than F. An increase in sample size is 
usually accompanied by a general de- 
crease in the magnitude of the power 
difference, the only exception being 
when the mean differences are small. 
In this case, an increase in the number 
of replications increases the magnitude 
of the power difference between L and F. 

For five treatments L is more powerful 
than F for all sample sizes, levels of sig- 
nificance, and noncentrality values, The 
only point at which the two appeared 
equally powerful was at the .001 level 
when noncentrality is 5.60 and greater. 

Figures 5 and 6 show power curves 
consistent with previous figures. When 
the number of treatments is 5 and the 
number of replications is 5, 10, and 15, 
large power differences favoring L were 
found at the .05 and .01 significance 
levels. This power difference was es- 
pecially noticeable at the .05 level for 
small noncentrality values. Large power 


k=SandN=5. 
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Fic. 6. Power curves for the F and L tests plotted against » when k = 5 and N = 10. 


differences were not found, however, for 
small mean differences. 

In conclusion, if hypotheses sug- 
gest large differences between treatment 
means, increasing sample size will in- 
crease the power of both tests with L 
remaining about 10 to 15% more 
powerful. 


Discussion 


This research gives empirical support 
to the prediction that the L test is more 
powerful for testing ordered hypotheses 
than the omnibus F test for the random- 
ized blocks design. This is empirically 
visible in the preceding figures. Gener- 
ally, as the number of treatments in- 
creases, the power difference between L 
and F increases in favor of L with the 
magnitude of the power difference be- 
tween F and L decreasing as the number 
of replications increases. As the number 
of replications and treatments increases, 
the power curves also tend to move 
slightly to the left, with L becoming 
more powerful than F for very small 
mean differences. 

In general, the greatest magnitude of 
power difference between the L and F 
tests appears to be at the .05 and .01 


levels. This is most noticeable when the 
number of replications is small. There 
is an increase in power for both tests as 
noncentrality increases with L being 
consistently more powerful. 

The only case where F was found to be 
more powerful than L was when the 
number of treatments was three and 
when the number of replications was 
five at the .001 level of significance. 
This may be attributable to sampling 
error, but the consistency of this reversal 
questions such a hypothesis. 
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A COGNITIVE INTERPRETATION OF INTENSITY 
EFFECTS IN STIMULUS GENERALIZATION 


FRANK RESTLE 


Indiana University 


A subject trained to a dim light responds well to a stronger light, but 
if trained to the stronger light he responds weakly to the dim. Hull 


calls this stimulus-intensity dynamism (V). 


If stimuli are represented 


as sets and a stronger stimulus as a larger set, then stimulus-sampling 
theory predicts an effect opposite to that usually observed. If learning 
amounts to selection of a strategy, as in cognitive theories, and if 
stronger stimuli lead to larger sets of adequate strategies, then weak- 
to-strong generalization is greater than strong-to-weak. This note 
formalizes and generalizes the cognitive theory, using a set-theoretic 


model. 


Consider two stimuli of different 
intensity, letting Ss be the more intense 
and Fi the less intense. In an experi- 
ment, Group Giss is first trained to 
respond to Stimulus Si, then is tested on 
Ss. Another group, Group Gr, is first 
trained to respond to S; and is then 
tested on Sı. It is repeatedly found in 
such experiments that the probability of 
response to Stimulus Ss by Group Giss is 
higher than the probability of response 
to Sı by Group Gs,; (Mednick & Freed- 
man, 1960). 


Hullian Theory 


This finding is discussed as Stimulus- 
Intensity Dynamism (V) by Hull (1949), 
who postulated that response strength is 
influenced by the intensity of the test 
stimulus, Since S; is more intense than 
Sı, it provides more dynamism. Since 
Group Giss is tested on the more dy- 
namic stimulus, it responds strongly. 
Since Group Gs, is tested on the weak 
stimulus, it responds weakly. True un- 
derlying generalization is assumed to be 
symmetrical. 

The postulate of Stimulus-Intensity 
Dynamism has little” independent sup- 
port, and does not¥ deduce the phe- 
nomenon from more elementary prin- 
ciples. A more fundamental analysis is 
difficult to obtain from Hullian theory, 
ford in that theory there is no natural 
representation of stimulus intensity, as 
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distinguished from qualitative variations 
of stimuli. 


Stimulus-Sampling Theory 


If stimulus situations are represented 
as sets, the intensity of a stimulus may be 
represented by the number or measure of 
elements in it. Recent discussions have 
shown that a dimension of stimuli can 
each be represented as a set, and the sets 
can be arranged in an orderly way 
(Restle, 1959, 1961a). 

One theory of generalization (Bush & 
Mosteller, 1951) employs sets in the 
framework of stimulus-sampling theory. 
Let S and S’ be two stimuli to which the 
subject initially does not respond. Sup- 
pose the subject is then trained to 
respond to Stimulus S with probability 
$. From various special assumptions it 
is deduced that the probability of a 
generalized response to S’ is p’, given by, 


„„ i 

p= SRD, . 
where m is a suitable measure function. 
Suppose that the training probabilities 
are equal for Groups Gi; and G. By 
definition the intersections will be 
m(Sı N Ss) and m(S;f) Sı), which are 
equal. Hence, the probability of gen- 
eralized response will be inversely pro- 
portional to m(S’), the measure of the 
test stimulus, 


This means more generalization to Si 
in Group Gs», and less generalization to 
Ss in Group Giss, the opposite of Med- 
nick and Freedman's (1960) conclusion. 

There are several possible retreats for 
the set-theory model. It could be argued 
that the experiments in question do not 

‘satisfy the important assumptions of 
“equal proportions” assumed by Bush 

4 and Mosteller (1951). However, sys- 
tematic employment of the stimulus- 

sampling theory (Estes & Burke, 1953) 
seems to insure that the average of a 
group of subjects would satisfy the equal 

proportions assumptions, since training 
will have progressed by random sampling 
of stimulus elements from the Training 
Situation S. Another argument would 
be that it is naive to represent the more 
intense stimulus by a larger set. It can 
be argued that incidental effects could 
have been introduced into the experi- 
ments, or that the Training Level p was 
not controlled, or that the empirical 
results are not extremely impressive 
when looked at in detail. 


Felection of Strategies: One at a Time 


The writer (Restle, 1961b, 1962) has 
used ideas from older cognitive learning 
theories in a “selection of strategies” 
theory. Suppose that the subject ac- 
quires the original response by selecting 
some one strategy; that is, by selecting a 
particular aspect of the situation and an 
appropriate response. If the response 
has a satisfactory outcome, it is assumed 
that the subject continues to use exactly 
the same strategy. 

Now imagine that the training situa- 
tion presents the subject with a Set H 
of strategies of which a Subset R lead to 
the response. Let m(H) and m ( be the 
measures of these sets. If a subject 
samples a single strategy from H, the 

probability it is in R is m(R)/m(H). 
‘In training the subject samples ran- 
domly from H. If his strategy leads to 
no response (is in R) then the experi- 
menter, by nonreinforcement or other 
device, induces the subject to resample. 
If at some point in training an element 
of R is chosen the experimenter rewards 
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the response or otherwise induces the 
subject to keep that strategy. Let P(R) 
be the probability that training has 
finally led the subject to select a strategy 
from R. 

The test situation makes available a 
Set H’ of strategies of which the Subset 
R lead to response. From elementary 
probability theory, 


P(R’) = P(R’|R)P(R) 
+ P(R’|R)P(R). (2) 


Assume that P(R’|R) is negligible. This 
idealizes the experiment by assuming 
that all responses to the test are gen- 
eralized responses. Then, 


P(R’) = P(R’|R)P(R). 


If the subject has a strategy in R he 
chose it in a single random sample. 
Therefore no subset of R is favored over 
any other, and, 


pir = "ENR 

whence, 

m(R A R’) 
m(R) 


or in notation like that of Equation 1, 


, „KO 
r =p [3] 


Equation 3 is similar to Equation 1. The 
differences are that Equation 3 refers not 
to the whole set but only to strategies 
which lead to response, and Equation 3 
has, as the denominator of the fraction, 
the measure of the training situation not 
of the test situation. 

By Equation 3, generalization is in- 
versely proportional to the measure of 
the training set. This means more 
generalization to Ss in Group Gs and 
less generalization to Sı in Group Gs. 
This is in agreement with the conclusions 
of Mednick and Freedman (1960). 


P(R’) = P(R) 


Selection of Strategies: Random Sample 


The results above depend upon the 
assumption that the subject uses exactly 
one strategy in R, but even if he should 
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use more than one the same general kind 
of result follows. Suppose, for example, 
that the subject uses n strategies in R, 
and will generalize to R’ if as many as one 
of these hypotheses appear in R’. Then 
the probability of generalization will de- 
pend upon the number of elements in R, 
R., and Rf) R’. Let the number of 
elements in R be r, the number in R’ be 
r’, and the number in the intersection 
Rf) R’ bei. Then assuming all elements 
have the same probability, the proba- 
bility of generalizing is: 
tee 
n 
| ( 

70 
n 


an (r—i-—n)!r! 


For fixed n and i, p’ decreases with r. 
To prove this, call p’ a function of r, and 
note that [1 — p’(r + 1)]/1 — p'(r)] is 
greater than 1. 


Conclusions 


The analysis given from cognitive 
theory has two advantages. Unlike 
Hullian theory, it has a natural and in- 
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trinsic way of representing intensity o 
stimuli, and predicts asymmetric gen 
ization without any special postulate. 
Unlike the stimulus-sampling theory, 
which it otherwise resembles, the cogni- 
tive theory predicts the asymmetry in the 
direction observed in experiments. 
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ACTUARIAL METHODS AS APPROPRIATE STRATEGY 
FOR THE VALIDATION OF DIAGNOSTIC TESTS * 


JACOB O. SINES 


University of Missouri School of Medicine 


The equivocal results of research conducted to validate the clinical use 


of psychological tests 


are attributed to inappropriate design or 


strategy. The point is made that the predictions of behavior on the 
basis of test performance in the clinic does not necessarily imply the 
predictions which are examined in many validation studies. It is 
suggested that appropriate validation studies involve: (a) the evalua- 
tion of the hypotheses made by the clinician, (b) about test patterns 


rather than isolated aspects of the entire test responses. 


Such an 


approach seems to epitomize the use of actuaarial techniques. Data 
are provided to illustrate several of the critical considerations. 


The published results of studies of the 
predictive and concurrent validity of most 
psychological tests do not seem to have 
seriously curtailed the clinical use of such 
techniques as the Minnesota Multiphasic 
Personality Inventory (MMPI), Ror- 
schach, Thematic Apperception Test, or 
even in some quarters the Draw-a-Person 
Test (DAP). Many clinicians know 
such tests are useful while the skeptics 
know equally well that the conventional 
methods which have been used to assess 
the validity of these test instruments do 
not yield evidence to support this wide- 
spread faith. 

This seeming inconsistency may be re- 
solyed in part if one closely considers 
the difference between the manner in 
which the clinician uses tests and the 
strategies most often used to study the 
validity of psychological tests. Brief at- 
tention to this difference may lead to a 
possible explanation of the equivocal re- 
sults of predictive and concurrent valida- 
tion studies and may at the same time 
suggest an alternative and possibly more 
productive validation procedure. 

As the clinician uses the Rorschach or 
the MMPI he infers that persons who 
perceive the blots in a specific fashion or 
who generate a particular type of profile 
show certain characteristic behavior or 
dynamics. If one lets P, stand for a 


1 The work reported here was supported 
in part by United States Public Health 
Service Grant MH 6097. 


particular test pattern and B, refer to a 
specified pattern of real life behavior, the 
clinician's activity when describing per- 
sonality on the basis of psychological test 
results may be represented logically as 
(if P, then B,). As discussed by Hoff- 
man (1960) this test-pattern factor (Py) 
in the clinical judgment process scems to 
involve a number of test variables XI, Xo, 
.. „ Xx, each of which may be assigned 
selective weights, or the interaction of 
which may be considered by the clinician, 
Although not the main consideration at 
this time, it should be kept in mind that 
Pi, the test pattern, may, and usually does 
consist of several variables rather than 
just one variable or score. This point 
will be returned to below. 

The skilled clinician bases his infer- 
ences on his past experience with persons 
who have responded to the test in more or 
less similar ways. The fact cannot be 
overemphasized that the clinician does 
not often or successfully predict a test 
pattern from a diagnosis, or even from 
several bits of socially relevant real-life 
behavior. It is indeed rare to find a 
clinician who can predict in great detail 
a patients MMPI profile or Rorschach 
psychogram on the basis of brief contact 
with him. The frequency with which one 
encounters psychologists who can de- 
scribe in great detail the behavior, 
thought pattern, or dynamics of some 
persons from only an MMPI profile or a 
Rorschach protocol is not unimpressive, 
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however, and hopefully disturbing to the 
critics of psychological testing. 

Unfortunately for the validity box 
score, many studies of the validity of 
psychological tests are not directly rele- 
vant to the question they were designed 
to consider. In most validation studies, 
a criterion group of persons is cus- 
tomarily tested and essentially the follow- 
ing prediction is put to test: “if a person 
shows certain real-life behavior then he 
will also show certain test behavior.” 
Stated in the terms mentioned above the 
logical form of this statement is “if By, 
then PI.“ Unfortunately this is quite a 
different matter from studying the valid- 
ity claimed for the test by the clinician. 
Quite simply, to paraphrase Ainsworth 
(1954), the study of the validity of a 
test should involve an evaluation of the 
predictions made by the clinician when he 
uses that test. To repeat, the clinician’s 
prediction is: “if a person shows a par- 
ticular pattern of test behavior then he 
will show certain other relevant behav- 
ioral characteristics.” Validity studies 
in which a discrete aspect of real-life 
behavior (suicide or response to therapy) 
is employed as the criterion variable, 
while regularity in a pattern of test be- 
havior is viewed as the dependent vari- 
able may thus not only be irrelevant 
to the validity of psychological tests as 
they are used, but they seem to imply a 
widespread but unjustified notion of the 
logic of predictive relationships and a 
disregard for the possibility that multiple 
catises may be operating to produce a 
given behavior pattern. 

The turmoil concerning the actuarial 
use of tests seems to have obscured the 
implications of one point stated by both 
Meehl (1954) and Holt (1958) which 
implies an alternative approach to the 
validation of psychological tests. In es- 
sence the actuarial approach as described 
by Meehl and the sophisticated clinical 
use of tests advocated by Holt, both in- 
volve the determination of the character- 
istic behavior, thinking pattern, or dy- 
namics found among persons who respond 
to tests in certain common ways, that is, 


whose patterns of test performance are 
similar, 
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It must be admitted that the relation- 
ships between test variables and personal- 
ity characteristics which Meehl might 
wish to formalize and which Holt con- 
siders important for the sophisticated 
clinician are not often clearly specified in 
the clinician’s apperceptive mass, nor are 
they generally reflected in studies of the 
validity of psychological tests. 

If one considers the assumed relation- 
ship between a socially significant be- 
havior pattern, BI, that one wishes to 
predict and various personality character- 
istics which may be reflected in Test 
Patterns PI, Pa, or Pz, the above point 
may become clearer. On the basis of 
training and experience the clinician may 
have found on many occasions that Be- 
havior Pattern B, is observed in persons 
who show Test Pattern PI. This is not 
a logically symmetrical statement in that 
it in no way implies that Behavior Pat- 
tern Bı may not also be found in persons 
who generate Test Patterns Pa or Fs, or 
otherwise stated, it does not imply that 
every person who shows Behavior Pat- 
tern Bi will also show Test Pattern Py. 

The usual study of predictive or con- 
current validity is designed as if the 
clinician’s behavior were “P, Bu that 
is, “Behavior Pattern B, implies Test Pat- 
tern Pi and Test Pattern Pi implies Be- 
havior Pattern B,.” Test Patterns P, 
and P, are too often not considered. 

Thus, many of the studies of predictive 
or concurrent validity which offer only 
equivocal support for this latter hypothe- 
sis “if Behavior Pattern B, then Test 
Pattern P,,” should at least raise the 
possibility that several personality pat- 
terns exist which may be reflected in Test 
Patterns Po, Pg, or P4 as well as P4, each 
of which may be predictively related to 
the behavior pattern we are interested in. 

This more realistic conception of pre- 
dictive relationships would lead to the 
hypothesis that “if PI (or Po or Pz) then 
By”; and this may in fact be more rep- 
resentative of the way the clinician func- 
tions in his actual work. 

Consider the possibility that a given 
behavior pattern (Bi) may be predicted 
from any of five test patterns. For the 
sake of illustration let us assume that of 
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100 persons who manifest Behavior Pat- 
tern BI, 40 generate Test Pattern Py, 
and 15 generate each of the Test Patterns 
Pa, Ps, P., and Ps. The chances are 
good that the clinician’s experience will 
lead him to state “if P, then B,.” If he 
should attempt to validate his conclusion 
by determining the average test pattern 
of a sample of persons who show Be- 
havior Pattern Bi, he will be unsuccessful 
to the extent that he has a representative 
sample of persons who show Behavior 
Pattern BI. His initial clinical observa- 
tion (if Pi then By) may have a very 
high degree of validity yet Test Pattern 
Pi may account for less than half of the 
variance he finds in his attempt to val- 
idate that statement. 

Gilberstadt (1962) has illustrated this 
point in a recent report in which he con- 
trasts mean and modal MMPI profiles in 
a group of neurodermatitis patients. 

It was noted above that there may be 
several variables and possibly their inter- 
action which constitute the test-pattern 
factor in clinical judgment. This seems 
to be the essence of the frequently re- 
peated caution that the diagnostic or pre- 
dictive significance of a Rorschach F 
+% or an MMPI Sc score depends 
upon the entire test pattern in which it 
occurs. 

Consider the implications if Behavior 
Pattern B, is in fact found in persons 
who generate some Test Pattern PI. 
Test Pattern Pi consists of several vari- 
ables, XI, Xo, .. . Xx, each of which are 
test representatives of personality factors 
which individually bear some predictive 
relationship to B,, but no one of which 
alone explains the major portion of the 
variance in By. 

We may study the predictive value of 
any one variable, Xj, in isolation if we 
wish, but logically we can expect a posi- 
tive response to an inquiry in the form 
of “if X; then By” in direct proportion to 
the weight of X; in comparison to the 
summed weight of XI, Xa, - + » Xx, or 
possibly some interaction of XI, Xa, --» 
Xx. Thus, even some studies of predic- 
tive or concurrent validity which appear 
to approximate the “if Pi then By” form 
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are actually tests of the hypothesis “if 
X; then Bi.“ 

A more reasonable and appropriate test 
of the predictive or concurrent validity of 
the Rorschach or the MMPI would seem 
to be “if personality (test) pattern Py 
then Behavior Pattern B,” (if P, then 
B,). And, furthermore, “if Pa then B, 
and if Ps then By.” A meaningful test of 
an instrument’s predictive or concurrent 
validity in these terms would necessarily 
involve the study of the real-life behavior 
of groups of persons whose test behavior 
is similar, rather than the study of the 
test behavior of diagnostic groups or 
groups of persons who manifest some 
limited amount of common real-life be- 
havior. If persons whose test protocols 
or test patterns are highly similar also 
show highly similar social behavior, 
thought patterns, or dynamics, than the 
concurrent validity of a specific test pat- 
tern would be supported. 

In order to test the above notions 
adequately, there appear to be at least 
three methodological considerations which 
must be clearly recognized, the last two 
of which were referred to by Holt (1950) 
some years ago. 

First, and of critical importance, it 
must be kept in mind that the predictive 
relationship between a test pattern and 
a behavior pattern is asymmetrical. The 
statement “if BI then P,” does not 
logically follow from the statement “if PI 
then B,” the latter being the logical 
statement describing the clinician’s use 
of his psychometric instruments. 

Second, it is essential that we have a 
reliable method for the recognition of a 
test pattern. This requirement is most 
efficiently fulfilled at the present time if 
the test data are reducible to numerical 
form, as, for example, in the MMPI or 
the Holtzman Ink Blots. Several meth- 
ods for the recognition of patterns or the 
calculation of indexes of profile similarity 
have been proposed (Meehl, 1959; Meehl 
& Dahlstrom, 1960; Mosel & Roberts, 
1954). Most methods have been soundly 
criticized for their obvious deficiencies 
and while each has some degree of utility, 
no general or widespread application of 
any one of these methods to psychometric 
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pattern recognition has appeared in the 
psychological literature to date. Suffice 
it to say that the investigator may 
choose from several methods, but what- 
ever his choice, its weaknesses will be 
pointed out very quickly by his colleagues. 
In the study reported below the gener- 
alized distance function as described by 
Cronbach and Gleser (1953) was used to 
assess similarity of MMPI profiles. 

Third and finally, one must determine 
the socially and psychologically relevant 
behavioral characteristics of the patients 
who generate a specific test pattern. It is 
on this point, too, that major disagree- 
ment may occur. The type of information 
one has about the persons already tested 
and the information one wishes to obtain 
by testing in the future will determine the 
Source and type of personality data to be 
dealt with. There are those who rely 
primarily on various Q sorts (Marks & 
Seeman, 1963), terms used in psychiatric 
Summaries (Gilberstadt & Duker, 1960), 
or statements derived from extensive 
clinical familiarity and previous research 
with a given test (Kleinmuntz, 1963; 
Pearson, 1962). None of the several 
sources of information are mutually ex- 
clusive and probably can be combined to 
provide the most comprehensive coverage. 
For a variety of reasons, the behavioral 
characteristics dealt with in the study re- 
ported below were drawn from the medi- 
cal folders of psychiatric patients, 


Illustrative Data 


A group of 25 highly similar profiles 
was identified in a sample of 500 male 


Mean Profile for Patients in First 
Computer Identified Cluster 
30 N=25 


Raw Score 7 3 24 14 20 23 30 21 10 28 2719 21 2 


Fic. 1. 


THEORETICAL Notes 


Mean Profile for Control Group 


N=20 
90 90 
A 
80 N 80 
70 70 
60 60 
50 50 
40 40 
30 * — 30 
Raw Score 6 7 I7 14 25 23 27 24 13 30 28 16 31 


Fic. 2. 


MMPI profiles obtained at the time of 
admission to a state hospital.2 Each pro- 
file in the sample was compared to every 
other profile and an index of profile simi- 
larity, the generalized distance function, 
D, was calculated for each pair of profiles 
(Cronbach & Gleser, 1950). The mean 
profile of this first cluster and the limits 
of +1 SD on each scale may be seen in 
Figure 1. A profile can be said to belong 
in this cluster (43 with L and K ele- 
vated) if it compares to the mean profile 
with a D of 22.00 or less (D = V30, 
where d is the difference in T score points 
between corresponding scales of two 
profiles). 

A control group of 20 profiles was 
selected from the same initial sample of 
500 MMPI profiles. The control profiles 
were chosen if they did not compare with 
more than two other profiles with a D of 
25.00 or less. The mean profile and the 
limits of +1 SD on each scale can be 
seen in Figure 2. | 

The hospital chart for each patient 
whose MMPI profile occurred in either 
the 43 group or the control group was 
then abstracted in detail, Each descrip- 
tive word, term, or phrase was recorded 
and sorted into one of 78 empirically de- 
rived categories. In the following dis- 


These data were obtained through the 
generous assistance of Howard R. Davis 
and William Leipold, Minnesota Department 
of Public Welfare. 9 

Other categories used in sorting descrip- 
tive statements in which relevant material 
may occur include: Affect, Cognitive Func- 
tioning, Interview Behavior, Mental Status, 
Relations with Parents, Social Behavior, as 
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cussion only two of these categories, Per- 
sonality Behavior and Personality Intra- 
psychic, will be considered. 

The number of hospital records on 
patients in each of the above two groups 
in which a particular statement occurred 
was tallied and compared using a two- 
tailed test of significance. There were 
125 different statements relating to “Per- 
sonality” to be compared in these two 
groups of patients. Of these 125 com- 
parisons, 18 were found to be significant 
at the .10 level or better ; of these 14 were 
found at the .05 level or better and 4 at 
the .01 level or better. Those statements 
relating to Personality in our scheme 
which occurred in a significantly different 
proportion (at least the .10 level) of the 
medical records in the 43 group and the 
control group are listed in Table 1. 

It can be seen that the incidence of 
assaultive- aggressive behavior is one of 
the outstanding characteristics (80%) of 


.. ̃ ͤ e a . —— 
well as others to be discussed in detail in a 
forthcoming report. 
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those patients who generated a 4’3 MMPI 
profile, while it occurs in only 40% of 
the control group of patients ($ <.01). 

In order to illustrate the asymmetry of 
the predictive relationship between the 
43 MMPI pattern and the assaultive- 
aggressive behavior pattern a third group 
of patients was identified who had been 
admitted during the past year and were 
known by the hospital staff to have been 
assaultive or to have threatened assault. 
The period during which these patients 
were admitted overlapped for only a few 
months the admission period of patients 
whose profiles made up the original 
sample of 500 records. The medical rec- 
ords of this group were reviewed to verify 
the existence of assaultive behavior or 
threat of assault in every case. A group 
of 31 patients was located, each of whom 
had either threatened to kill or had seri- 
ously assaulted someone, and each of 
whom had generated an MMPI profile on 
admission to the hospital. The mean 
profile and the limits of +1 SD on each 


TABLE 1 


STATEMENTS IN PERSONALITY 


CATEGORIES FOUND SIGNIFICANTLY MORE FREQUENTLY OR 


SIGNIFICANTLY LESS FREQUENTLY IN THE 4'3 PROFILE GROUP THAN IN 


THE CONTROL Group (p = .10 or less) 
Frequency 
e ee ee e Statement 
4’3 Group group 
(N = 25) ( = 20) 
0332058 20 3 4.334 Impulsive 3 
0332081 20 8 2.949 History of hostile or aggressive behavior 
0332009 if 0 2.575 Defensive 
0333027 10 2 2.261 Lack of sense of responsibility 
0332031 8 1 2.250 Seeks attention 
0332064 8 1 2.250 Manipulative 
0333043 7 it 2.121 Sensitive 
0332059 5 0 2.121 Ingratiating manner 
0333023 13 4 2.200 Immature 
0332089 4 0 1.874 Said to be a follower 
0333066 10 3 1.838 Uses projection 
0332010 11 4 1.697 Demanding 
0332098 0 6 2.941 Has a violent temper : 
0332035 1 6 2.391 Shows symptoms of anxiety and tension 
0332102 0 3 2.004 Mild mannered 
0332099 0 3 2.004 Felt to be dangerous 
0332107 0 3 2.004 Has not threatened or injured others 
0332048 2 6 1.918 Generous 


$22 


Meon Profile for Verified 
Assoultive Patients 


N=31 


30 30 
Row Score 4 @ 15 16 22232825 12 26 30 2 28 
Fic. 3. 


scale of the MMPI obtained from the 
assaultive patients are shown in Figure 3. 
e was one patient in the “known 
assaultive” group who generated a 4⁄3 
MMPI profile. The D value obtained 
from a comparison of the mean 4’3 profile 
and the mean profile of the assaultive 
group is 45.00, clearly larger than the D 
of 22.00 stated earlier as the defining 
parameter for inclusion in the 4’3 group. 
On the basis of these data one can state 
with a reasonable degree of confidence 
that a 4% MMPI profile as defined here 
implies a behavior pattern which includes 
socially serious threats of assault or 
actual assault. The reverse does not fol- 
low, however, in that only a few of those 
patients whose behavior pattern involves 
dangerous assault generate a 4⁄3 profile. 
In order to be able to predict all instances 
of dangerously assaultive behavior we 
must continue to look for additional test 
patterns (Pa, Pa, etc.) ; and it is conceiv- 
able that some of these patterns may be 
obtained most efficiently with tests other 
than the MMPI. ` 
In conclusion, Holt (1958) may have 
been quite right in stating that the be- 
leaguered clinician feels he has “been sold 
up the river” in discussions of actuarial 
versus clinical prediction. This may, 
however, be the result of his use of inap- 
Propriate strategy in testing his observa- 
tions regarding the validity of some of 
his psychometric instruments. This un- 
fortunate and frustrating state of affairs 
need not continue, 
The increasing availability of high- 
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speed data-processing equipment makes 
possible the critical yet relevant examina- 
tion of the predictive and concurrent 
validity of a wide variety of highly spe- 
cific patterns of responses to any of our 
psychometric instruments, provided those 
patterns can be reduced to digital form. 
There are several such actuarial pro- 
grams underway at the present time using 
the MMPI (Gilberstadt & Duker, 1961; 
Kleinmuntz, 1963; Marks & Seeman, 
1963; Pearson, 1961). In principle, how- 
ever, there is no reason why the validity 
of a number of other testing instruments 
cannot be appropriately evaluated. There 
is the possibility at least that there may be 
specific patterns of Rorschach responses 
(or even patterns of scorable DAP per- 
formances) which are predictive of some 
real-life behavior. 
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COMMENTS ON “A RESONANCE THEORY OF 
‘MICROVIBRATIONS’ ” 


HUBERT ROHRACHER 


University of Vienna, Austria 


Experiments pertaining to the theoretical interpretation of “micro- 


vibration” 


(permanent micromovements of the human body) are 


quoted which exclude any resonance effects and thus are contradictory 
to Williams’ hypothesis that the frequency of microvibration originates 


in resonance effects. 


In his paper “A Resonance Theory of 
‘Microvibation’” Williams (1963) tries 
to show, 


that the periodic microvibration of the body 
surface recorded by him [Rohracher] does 
not require the assumption that there exists 
any physiological event occurring at that 
frequency [p. 547]. 


Williams proposes a “resonance theory” 
suggesting “that the frequency is deter- 
mined by the physical characteristics of 
the body-transducer system used for its 
recording [p. 547].” In an attempt to 
validate his theory Williams reports a 
frequency comparison of records pub- 
lished in my 1949 monograph (Roh- 
racher, 1949), which were obtained with 
two different types of transducers (piezo- 
electric and electrodynamic pickups). He 
finds differences between these pickup 
types (the piezoelectric pickup yielding 
frequencies in the 6.5-8.5 cps range, the 
electrodynamic one in the 9-13 cps 
range), but does not take into account 
that the two types of transducer differ 
also with regard to the size of the body 
surface they are applied to: The piezo- 
electric pickup has only point-by-point 
contact with the body surface (through 
a needle which is inserted into a phono- 
graph pickup) whereas the electrody- 
namic pickup manufactured by Philips 
has a relatively large contact surface (5 
millimeters in diameter) thus covering 
many more muscle units than the needle. 
Moreover, the records examined by Wil- 
liams come from different subjects. 
. The differences in frequency—so Wil- 
liams asserts—suggest that microvibra- 


tion frequency should be regarded an 
artificial effect of the transducer-body 
system, “determined by the physical char- 
acteristics of both [p. 552].” This hy- 
pothesis can be tested by using methods 
of recording which do not involve the 
attachment of any load to the part of 
body from which the record is taken. 
In such a case there cannot be any reso- 
nance because nothing is present which 
were to resonate. I have carried out 
experiments of this type using two dif- 
ferent procedures. Of these Williams 
refers to only one: the direct microscopic 
method without any electrical transducer. 
(A diaphragm is placed on the subject's 
forearm. A light beam passing through 
the diaphragm is recorded and reflects 
the movements of the diaphragm). How- 
ever, Williams does not report the weight 
of the diaphragm: this was only 0.6 gram. 
It is difficult to imagine how as little a 
weight should cause any significant reso- 
nance effect. The second experiment 
without any electrical transducer (pub- 
lished in my 1949 monograph, together 
with sample records and figures showing 
the experimental setup) is not mentioned 
in Williams’ report. In this latter case 
the subject's arm rests—without any load 
between the two plates of a large 
condenser: micromovements of the arm 
change the capacity of the oscillating 
circuit which the condenser is part of. 
Both methods yielded frequencies iden- 
tical to those obtained when using an 
electrodynamic or a piezoelectric pickup. 

This would leave us only with the 
possibility that—even without any load 
being attached to it—the subject’s arm 
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itself represents a resonance system which 
is activated by the pulse and thus pro- 
duces the microvibration. However, this 
hypothesis has been rejected by Sugano 
(Sugano & Inanaga, 1960) in experi- 
ments in animals: if the motor innerva- 
tion of an extremity is interrupted micro- 
vibration of this extremity ceases almost 
completely although blood circulation and 
pulse remain unchanged. 

On the basis of these experiments I 
have no doubt that the frequency of 
microvibration does represent @ real 
parameter of an organismic process which 
takes place, permanently, in the muscula- 
ture (alternating contractions of single 
motor units). 

In recent years the author did not use 
the above mentioned two methods any 
more as they require a very complicated 
experimental setup. (When using the di- 
rect microscopic recording technique it 
is difficult to focus the light beam on the 
diaphragm. The capacitive method is 
unsuitable for comparative amplitude 
measurements because it is impossible to 
standardize the distance between the 
subject’s arm and the condenser plates— 
be it for the simple reason that there are 
differences between subjects in the size of 
the arm.) Therefore the author came to 
use electrodynamic and piezoelectric 
transducers exclusively in his recent ex- 
perimental work. At present the author 
works with a very small seismic piezo- 
electric pickup weighing as little as 15 
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grams which has been developed by 
Japanese investigators. 

The pickups used in my own experi- 
ments are required to have no resonance 
effect whatsoever in the microvibration 
frequency range. Therefore, for many 
years, all pickups used in my laboratory 
have been checked for their precise reso- 
nance frequencies. (A calibration appa- 
ratus consisting of a vibrating table which 
is driven by a low frequency generator 
has been constructed specifically for this 
purpose.) In measuring microvibration 
I am using only transducers with a 
resonance frequency which is either 
damped perfectly or lies in a frequency 
range far outside that of microvibration. 


REFERENCES 


Marko, A. R. Optisch- mikroskopische Reg- 
istrierung der Mikrovibration des mensch- 
liken Körpers. M ikroskopie, 1959, 14, 102- 
105. 

RonnachEn, H. Mechanische Mikro- 
schwingungen des menschlichen Körpers. 
Wien: Urban & Schwarzenberg, 1949. 

Ronracuer, H. Ständige Muskelaktivität 
(“Mikrovibration”), Tonus und Konstanz 
der Körpertemperatur. Z. Biol., 1959, 111, 
38-53. 

Rowracuer, H. Permanente rhythmische 
Mikrobewegungen des Warmbliiterorganis- 
mus (‘Mikrovibration’), Die Naturwissen- 
sch., 7, 1962, 145-150. 

Sucano, H., & INANAGA, K., Studies on 
minor tremor, Reprinted from Jap. J. 
Physiol., 10, 1960, 246-257. 


(Received April 27, 1964) 


tie * N 326-27 


A RESONANCE THEORY OF “MICROVIBRATIONS”: 
A REPLY TO ROHRACHER 


JAMES G. L. WILLIAMS 
Nebraska Psychiatric Institute, University of Nebraska College of Medicine 


It is suggested that the resonance theory adequately accounts for 
those of Rohracher’s findings which he quotes as being inconsistent 
with it and that there is at present no evidence for or need to suggest 
a relationship between the frequency of microvibrations and that of the 


“alternating contractions of single-motor units.” 


This suggestion in 


no way detracts from the importance or value of the microvibration 


phenomenon itself. 


Rohracher criticizes the resonance 
theory and supports his own contention 
that “the frequency of microvibrations 
does represent a real parameter of an 
organismic process which takes place, 
permanently, in the musculature (alter- 
nating contractions of single motor 
units)” by disagreeing with what he 
believes to be two critical points in the 
development of my argument (Williams, 
1963). 


First, he claims that, 


In an attempt to validate his theory (my 
italics) Williams reports a frequency com- 
parison of records published in my 1949 
monograph . . . which were obtained with 
two different types of transducer... . He 
finds differences between these pickup types 
+ ++ but does not take into account that the 
types of transducer differ also with regard 
to the size of the body surface to which they 
are applied [p. 524]. 


This is incorrect: the limitations of the 
kind of retrospective analysis made on 
data obtained by scoring published repro- 
ductions of records the descriptions of 
which were not always definitive were 
most clearly and explicitly emphasized. 
Certainly no attempt to validate a theory 
was made: it was concluded only that the 
analysis did “tend to support the hypothe- 
sis that the frequency is dependent on the 
transducer assembly used Ip. 551].” 
(Ironically, Rohracher himself now adds 
weight to this conclusion by ascribing the 
apparent frequency differences to differ- 
ences in the dimensions of the probes of 
the two types of transducer.) Later, 


however, I did describe—and here it was 
as part of the validation of the resonance 
theory—a critical experiment especially 
designed to overcome the acknowledged 
limitations of this retrospective analysis 
by comparing the frequencies obtained 
from two transducer assemblies differing 
only in their resonant frequencies and us- 
ing the same probe on the same site with 
the same subjects. The differences in 
the recorded frequencies were highly sig- 
nificant (p < .001, one-tailed test) and it 
was pointed out that “This result accords 
well with the previous re-analysis of 
Rohracher’s data [p. 554].” Rohracher 
makes no reference to this experiment. 

Secondly, Rohracher describes two ex- 
periments the results of which he suggests 
are incompatible with the resonance 
theory and points out that in the 1963 
paper, I refer specifically only to the 
light-beam experiment and not to the 
capacitative one, Actually, the former 
was cited merely as being typical of those 
of his “experiments which do not make 
use of electromechanical transducers to 
record the vibrations [p. 552].” The use 
of the resonance theory to explain the 
results of this particular experiment was 
offered as a specific illustration of its 
general applicability. The capacitative 
experiment would have served equally 
well to illustrate the point: in this case 
the mass, spring and damping constants 
of the capacitative pickup (as compared 
with only the last two in the light-beam 
experiment) would all be equated with 
zero and only those associated with the 
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arm itself would determine the natural 


or nonoccurrence of microvibrations (as, 
indeed, is confirmed by Rohracher’s own 
experiments). However, it is extremely 
important to note that if a high damping 
constant is associated with aperiodic 


high damping constant may have been 
involved. Rohracher's obtaining a micro- 
vibration wave form with a frequency 
which was “identical to those obtained 
when using an electrodynamic or a piezo- 
electric pickup [p. 525]” is inconsistent 
with his own current interpretation that 
the different pickup areas of the piezoelec- 
tric and electrodynamic transducers and, 
presumably, the condenser plates would all 
yield recordings showing different fre- 
quencies. Nevertheless, accepting these 
as identical and characteristic frequencies, 
it still does not follow that we are left 
“only with the possibility that—even 
without any load being attached to it—the 
subject’s arm itself represents a reso- 
nance system which is activated by the 
pulse and produces the microvibration [p. 
525].” And even if this were so, the sub- 
sequent statement that “this hypothesis 
has been rejected by Sugano (Sugano & 
Inanaga, 1960)” has no relevance to the 


theory: in fact, the close relationship be- 
tween muscle activity and microvibration 
amplitude was discussed in detail and 
illustrated in my 1963 paper and subse- 
quently (Williams, 1964). 

One final point: Rohracher concludes 
his criticism by observing that “The pick- 
ups used in my own experiments are re- 

ve no resonance effect what 
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of 
It is maintained that not only is there 
need to postulate the existence of a 
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A SUGGESTED ALTERNATIVE FORMULATION IN THE DEVELOP- 
MENTS BY HURSCH, HAMMOND, AND HURSCH, AND 
BY HAMMOND, HURSCH, AND TODD! 


LEDYARD R. TUCKER 
University of Ilinois 


In the paper “Some methodological 
consideration in multiple-cue probability 
studies” (Hursch, Hammond, & Hursch, 
1964), standardized random variables F., 
V., X; +++ X. were considered such that 
Y, was the distal variable in the en- 
vironment, V. was the subject’s estimate 
of the distal variable, and XI.. . X. 
were the cue variables. 

My suggestion involves consideration 
of the variables F. and P. defined by, 


¥; g. 1X + +++ + 8. X. [1] 
f; = 8. 1K R + B. X. 

where the 8's are multiple linear regres- 
sion weights for predicting the variables 
Y, and F. from the cue variables 
XI.. Xa F. and F. are the predicted 
components of the variables Y, and Y, 
while the residuals Z, and Z,, defined 
in their Equations 4 are the errors of 
linear prediction of these variables. 
Their Equations 4 may be written, 


F. = f. . ＋ Z. 
Y, F. + Z. [2] 
R? and R? are the squares of the mul- 
tiple correlations involved and may be 
determined as per their Equations 5. 
From the Propositions of multiple linear 
regression theory, 


var (F.) N.. 


var (P.) = R}. [3] 


alyzing the components of clinical inference” 
by Hammond, Hursch, and Todd. 


as well as, 


var (Z) = 1 R.. 
var (Z,) = 1 — R}. (4) 
where my Equations 4 correspond to 
their Equations 6. The important point 
of my suggestion is that their coefficient 
fa, the correlation between Y, and Y, can 
be shown to have the following relation, 


fa = Cov (F. F.) = Cov (P. P.) 
+ Cov (Z. Z.). [5] 


Or, if the correlation (P. P.) = G and the 
correlation (Z. Z.) = C, 


re = GRR, + CVI — R2Vi— R. [6] 


Hursch, Hammond, and Hursch de- 
veloped Formula 27 (p. 55), 


R2+R2— (8.,:—8,.:, Ted ric 
r 


c VU-RAO-RA. 


where: r correlation (F. XH). 7. cot: 
relation (F. XH, and 


(Bei — Basir ret — 10,4) 


E G. 8. 0 (rei — rs) = Lt 
‘ 


the Qd form being used in the paper by 
Hammond, Hursch, and Todd. The last 
term in the above equation and in my 
Formula 6 are identical. A difference 
occurs in the form of the first term on 
the right sides of these equations. These 
two forms are equal, however, 


„ 


2 ¹ 


GR. R. 
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These two forme of the term may TABLE 1 
be differentially useful for different z 


purposes. 
The form of Equations $ and 6 bas an 
interesting parallelism in the two terms 


involves a between com Mean 75 

ts: the first term involves the Joker 1) -M vs 763 
linearly estimated components of the wie? wn | yoo 727 
distal variable and the subject's estimate 5 af 116 — 
of the distal variable, while the second * mos a | i 


: 
s 
5 
P 
i 
+ 
i 
i 


: 


mated component of the distal variable cated.” The values of G for 
and R, is the length of the vector for the these clinicians and the median 


i 


the 

ject's response vector. Gis the cosine of following table 
ihe angle between these two linearly esti- The interesting trend in correlation 
mated component vectors. size from the Naive group to the Semi- 
of the two residual vectors are 1— RA ted group to the ted 
and I RA with the cosine of the group is statistically significant only to 
angle between these two vectors being C the extent that the correlations for the 
The direction of the vector for the Naive group are below the correlations 
linearly estimated component of the sub- for the other two groups. These results 
ject’s responses is dependent on the rela- could indicate a objective in 


sidered separately from the length meas- ponents should be determined by the 
ure, R., of the linearly predicted com- correlation of the trainees’ residual com- 
ponent of the subject's response variable. ponents with the residual components of 
Interpretive value of the correlation G IQ. A second objective of training could 
between the linearly predicted com- be to increase these latter correlations; 
ponents can be illustrated on the data however, as noted by Hammond, Hursch, 
from the study by Grebstein (1963) on and Todd, the possibilities of this case 
estimation of IQ from Rorschach psycho- should be the subject of research. . 


530 4 e e Nort. S . 


R CES clinical inference. Psychol. Rev., 1964, 71, 
— L. r 3 
tuarial clinicians, Hurscn, Caroryn J., Hammonn, K. R., & 
and graduate ts in a clinical judg- Hursca, J. L. Some methodological con- 
ment test. J. . Psychol., 1963, 37, siderations in multiple-cue probability 
127-132. 


studies. Psychol. Rev., 1964, 71, 42-60. 
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